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Abstract 


Main results . A rigorous proof that the number of comparisons of keys 
performed in the worst case by Heapsort on any array of size iV > 2 is equal 
to; 

2{N - 1) (Ig ^^ ^ + g ) - 2s2(iV) - e 2 {N) + min([lg(iV - 1)J,2) + 6 + c, 
where e, given by: 

e = l+\\g{N- 1)1 - Ig (A - 1) - 

is a function of N with the minimum value 0 and and the supremum value 


5 = 1 - Ige + Iglge ^ 0.0860713320559342, 


S 2 {N) is the sum of all digits of the binary representation of N, 62(A) is the 
exponent of 2 in the prime factorization of A, and c is a binary function on 
the set of integers dehned by: 


c = 


1 if A < 2^15^1 - 4 
0 otherwise, 


*(c)2015 Marek A. Suchenek. 
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is presented. 

An algorithm that generates worst-case input arrays of any size N > 2 
for Heapsort is offered. The algorithm has been implemented in Java, runs 
in 0(iV log iV) time, and allows for precise experimental verihcation of the 
above formula. 

Significance. The worst-case behavior of Heapsort has escaped mathe¬ 
matically precise characterization for almost hve decades now. This paper 
hlls that important gap. The exactness of the derived number of comparisons 
of keys performed by Heapsort in the worst case, as opposed to merely big- 
oh or ~ asymptotic approximation or bound, allows for direct and dehnite 
experimental verihcation of its correctness. 
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Introduction 

Many of those who could not figure it out exactly were quick to 
dismiss the importance of precisely knowing it. But there are some 
significant advantages of knowing the exact value as opposed to its 
approximation. For one, it can be conclusively verified by means 
of a direct experiment. 

Some researchers tend to believe that undergraduate Computer Science 
is not an area for intellectually challenging and interesting problems. Some 
even go as far as to dismiss pursuit of their solutions as pedagogy. But such 
a prejudice appears, well, prejudice, as there do exist questions that belong 
to undergraduate CS, yet they have been declared difficult to answer even 
by some renowned scholars. Although the deceitful simplicity of some of the 
solutions of problems that were once considered hard might prompt a skeptic 
to entrench himself in his dismissiveness, it should not puzzle those sympa¬ 
thetic to the P 7 ^ NP conjecture, one of the consequences of which stipulates 
the existence of hard to solve problems whose solutions are straightforward 
to verify as soon as their witnesses have been found. 

Take, for instance, Heapsort invented by Williams HH and enhanced by 
Floyd [ 5 . This specimen of elegance and simplicity, and a classic that has 
been taught across curricula of virtually every ABET-accredited Computer 
Science program, does belong to to undergraduate CS, yet it apparently has 
resisted attempts of some seasoned researchers to accurately characterize its 
worst-case performance. In this article, I will use rather elementary math¬ 
ematics to bring the worst-case analysis of Heapsort to the point that one 
could consider complete. 

The analysis I present here is not particularly shortj^ This does not come 
as a total surprise, taking into account almost hve decades that passed with¬ 
out its completion. Although Shaffer and Sedgewick declared long time ago 
that their paper j9] “essentially complete[d] the analysis of [Heapsort],” they 
also admitted that “there [was] another quantity that contribute[d] to the 
leading term of the running time that require[d] more intricate arguments,” 


^The rigorous proof of the main result is rather lengthy, particularly when compared 
to the short and elegant algebraic derivation in m of a similar result for the Floyd’s 
heap-construction program that constitutes the first (and faster) half of the Heapsort. 
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for which they had “little specihc information about the distribution beyond 
what [was] implied by [their] asymptotic results.” Clearly, they did not at¬ 
tempt to derive the exact formula for the number of comparisons of keys that 
Heapsort performs, which remained unknown for 49 years. 

So, here it is: for every natural number N >2, the number of comparisons 
of keys performed in the worst case by the Heapsort on any array of size N 
is equal to: 

2{N - 1) (Ig ^^ ^ + g ) - 2 s 2(A^) - e2(iV) -h min([lg(A^ - 1)J, 2) -h 6 -h c, 
where e, given by: 


e = 1 + 0 - 2%nd 0 = [Ig (iV - 1)] - Ig (iV - 1), 

is a continuous function (visualized on Figure [3^ page 95) of on the set of 
reals > 1, with the minimum value 0 and and the maximum {supremum, if 
N is restricted to integers) value 


5 = 1 - Ige + Iglge ^ 0.0860713320559342, 


S 2 {N) is the sum of all digits of the binary representation of N, 62(iV) is the 
exponent of 2 in the prime factorization of N, and c is a binary functioiij^ 
(visualized on Figure 23 page 68) on the set of integers dehned by: 


c = 


1 if iV < 2rig^l - 4 
0 otherwise. 


Moreover, if > 5 then the above formula simplihes to: 

2{N - 1) (Ig + £ ) - 2s2(iV) - e2(iV) + 8 + c. 


The method I chose for my derivation the above formula could be char¬ 
acterized mostly as a brute force approach, with some subtler inductive ar¬ 
guments without which the brute force alone would not accomplish much. 
Nevertheless, the elementary algebra involved in it seems well-worth study¬ 
ing in its own right as it also applies to other problems that are related to 
sorting and to hnite binary trees. 


^Algebraically, 1 — c = \\g{N + 4)] — [Ig N~\. 
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1. An overview 


The main subject of my analysis is the number (A) of compar¬ 

isons of keys that Williams’ vanilla Heapsort with Floyd’s improvement per¬ 
forms in the worst case while sorting an A-element array of distinct integers. 
It consists of three major parts. The goal of the hrst part is to show that the 
said worst case number of comparisons is the sum of the respective numbers 
CMSeHeap(^) ^ud for the heap-construction phase MakeHeap 

and the heap-deconstruction phase RemoveAll. The goals of the second and 
the third part are to derive the formulas for C'MSeHeap(^) ^RemoveAii()(-^)- 
Those three, once completed, yield the formula for C'Heapsort(-^)- 

The hrst part turned up the easiest of the three. Running MakeHeap 
backwards on any given heap resulted, and provably so, in an array that 
MakeHeap would turn onto H while performing the maximal possible number 
of comparisons for any array of that size. 


The second part was somewhat harder; however, it has been recently 
nailed down with a simple closed-form formula for C'MakeHeap(-^) that had a 
succinct and elegant algebraic prooQ 


The third part was considerably more complicated than the other two. A 
fairly simple strategy for generating bad cases for RemoveAll by running it 
backwards led to a straightforwar(^ if a bit tedious, derivation of a closed- 

but that lower bound 
I knew. It was 

the demonstration that the said strategy could not be beaten, which fact 
allowed me to conclude that the derived lower bound was also an upper 
bound and yielded a proof of the closed-form formula for C'Rej[oveAii()(-^)’ 
that was surprisingljj^ convoluted and resistant to simplihcations. 


form formula for a lower bound of C'RemoveAii()(^)’ 
was less than the “easy” 


upper bouncQ of C'“:^o,eAii{) (^) 


Well, there must have been a reason why, to my best knowledge, a journal- 
quality proof of the worst-case formula had not been published despite the 
fact that it had been done, even if in a somewhat rough form, for the special 


"‘See [10] for such a proof. 

^At least for those fluent with the kind of math that I am using in this paper. 

®(2iV - 1) [\g{N - 1)J - 2Lig(^-i)J+2 + 4, 

^It did look at the beginning as a simple exercise, only to turn out elusive as it kept 
evading my attempts to be formulated precisely. 
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case of inputs of the size N = — 1 some 36 years ago. After finishing 

the said proof I think I got a pretty good idea why all those who attempted 
it might have given up before bringing their efforts to a conclusive end. 

Out of several factors that made such a proof not quite a routine exercise, 
a flaw in the structure of the worst-case heaps for RemoveAll deserves a 
special mention. It turns out that the number of hereditarjj^worst-case heaps 
for RemoveAll is hnit^ (1017 to be exact), which rules out the existence of 
any greedy scheme of generating worst-case heaps for RemoveAll of arbitrary 
size (greater than 22). As a result, the proof of the said formula hangs on 
a singularity of worst-case heaps which states that if A^ = — 4 and H 

is a heap on A^ -|- 1 nodes such that the execution of i7.RemoveMax() on H 
performs the maximum + 1), over all heaps on A^ -|- 1 nodes, 

number C^emovena.x{){H) of comparisons of keys then the heap produced by 
the execution of i7.RemoveMax() is not a worst-case heapp^ Laying down 
foundations for a demonstration of the above fact was perhaps the most 
tedious task in this study. 


Here is a road map of the paper. 

Section page lA and on, lays down basic dehnitions and algebraic facts 


related to the subject matter. 


Section page W and on, introduces notation that I use in this paper, 
some of which may differ from the notation used by other authors. 

Section page W and on, describes a basic technique for constructing 
worst cases for Heapsort and its components by running them backwards. 
Methods PullDown, unFixHeap, and unRemoveMax are introduced there. 

Section page and on, presents a constructive proof that the exact 
characterization of the worst-case performance of Heapsort may be computed 
as a sum of exact characterizations of the worst-case performances of its two 


®A heap H is a hereditary worst-case heap if, and only if, it is a worst-case heap and 
il.RemoveMax() is either empty or is a hereditary worst-case heap. 

®The fact that not all worst-case heaps are hereditary worst-case heaps follows also 
from the fact that the number CjismoveKa.x()iH) of comparisons of keys that the operation 
il.RemoveMax() performs on a worst-case heap H on N nodes is not a function of N. 

^°It is a singular property, indeed, as for every N ^ —4, there is a worst-case heap 

H on N + 1 nodes with CRe„,oveMax()(-ff) = C'^moveMax() + 1) ^i^ch that il.RemoveMaxO is 
a worst-case heap. 
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phases: the heap-construction phase (for which a closed-form formula has 
been recently discovered) and the heap-deconstruction phase. This is accom¬ 
plished by demonstrating that for any heap of N nodes, method unFixHeap 
- a reverse of FixHeap - constructs an iV-element array that constitutes a 
worst-case array for the heap-construction phase. 


At this point, the only piece of information that is needed for the exact 
characterization of the worst-case performance of the entire Heapsort is a 
formula for the worst-case performance of the heap-deconstruction phase. 

Section page and on, introduces a solitaire game of Pull Downs a 
payoff of which is equal to the number of comparisons of keys that are needed 
to deconstruct a heap produced by the game. The said payoff is maximal if, 
and only if, the produced heap is a worst-case heap. The Section establishes 
some 1 — 1 correspondences between heaps produced by the game and various 
generators of the game, which provides some notational convenience needed 
in the remainder of the paper and assures consistency of the derived results. 
The above facts reduce the problem of construction of a worst-case heap 
for the heap-deconstruction phase of Heapsort to the problem of hnding a 
generator of a suitable game that yields a maximal payoff. 

Section page ^ and on, contains dehnitions and technical details of 
computations of credits for moves in the game of Pull Downs. It provides 
characterization of moves that yield maximal credits and evaluates losses of 
credits for some sequences of moves. 

Section page ^ and on, is mostly notational. It introduces the concept 
of a strategy and dehnes various forms of payoffs, including the upper-bound 
payoff, and losses of credit related to it. 


Section!^ pageand on, introduces some special strategies for the game 
of Pull Downs: a sub-optimal strategy par and a family of strategies win(A^) 
(one strategy for each N >2), each being optimal for given N. Strategy par 
is sub-optimal in that it loses 1 credit per level, relative to the upper-bound 
payoff, of the heap constructed, from the level 3 on, and is optimal for each 
complet^^ heap it constructs. It also maintains certain invariant property 
of the heaps it produces that is needed for the demonstration of optimality 
of its improvements. For each N < — 4, the strategy win(A^) improves 


iV = — 1 nodes, that is. 






M. A. Suchenek: A Complete Worst-Case Analysis of Heapsort (MS) 


9 


upon par (by the total of 1 credit) in that, unlike par, it is greedy in the 
level [IgiVj of the heap constructed, so that it postpones the 1 credit loss in 
that level until no further postponement is possibl^^ The payoffs for those 
strategies are derived there. They establish a lower bound for the worst-case 
behavior of the heap-deconstruction phase. 


Section page and on, is devoted to proofs of optimalities of strate¬ 
gies par and win(A^). The optimality of win(A^) allows me to conclude that 
the lower bound derived in Section and given by the formula for payoffs 
for win(A^) is an upper bound, too, thus yielding the sought-after charac¬ 
terization of the worst-case behavior of the heap-deconstruction phase. The 
mostly case-driven proof uses, at some point, the fact that no strategy can 
gain relative to a strategy that is optimal for complete heaps (in particular, 
relative to par and win(A^)) more than 1 credit an any level of the heaps that 
it produces. It allows strategies win(A^) that are optimal for complete heaps 
and greedy in the levels [Ig iVj of heaps they construct to collect the maximal 
payoffs and, therefore, be optimal. It also reduces considerably the number 
of cases that need to be tackled in the said proof. 


Subsection 10.1 page ^ and on, attempts to explain why the optimality 
proof I present in this paper is more complicated than one could expect it to 
bej^ It derives some intuitively simple fact^^that entail the optimality of 
win(A^), and demonstrates that they are about as difficult to prove as those 
in Section [TOl 


The remainder of the paper easily follows from the above. 


Section 11, page 91 and on, proves a closed-form formula for the worst- 


case number of comparisons of keys performed by the heap-deconstruction 
phase of Heapsort. 


Sectionpage and on, proves two closed-form formulas for the worst- 
case number of comparisons of keys performed by the Heapsort, one with 
function floor and one (mostly) without it. 


^^Until the move 2 ^^® v] _ ^ ^ 

^^The singularity of worst-case heaps of size N = — 3, indicated on page[^ 

seems to be the culprit here. 

^"‘The most basic of which is the Singularity Theorem 


10.13 


page 


worst-case heap of size N = 2 r'S'^l — 4 admits a lossless pull down. 


87 


stating that no 
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Section 13, page 96 and on, analyses the behavior of the “jumpy” term 
2 s 2 {N) + 62 (iV) in the mentioned above formulas and offers its tight upper 
bound expressed by a function that is continuous on the set of reals except 
for N = + 1 . 


Section 14, page 99 and on, comments on the origins of this article, with 


Subection 14.1, page 100 and on, comparing the presented results to those 
published in an old report by Kruskal and Weixelbaum. 


Appendix A offers illustrating examples. Appendix A.l[ page 105 and 
on, shows details of construction of a 12 -element worst-case heap for the heap- 


deconstruction phase of Heapsort in first 11 moves of strategy win(15). Ap 
pendix A.2 page 107 and on, shows details of construction of the last level of 
a 31-element worst-case heap for the heap-deconstruction phase of Heapsort 
in moves 15 through 30 of strategy win(31) 


Appendix A.3 


page 


110 and 


on, shows program-generated examples of 500-element worst-case array for 
the heap-construction phase and 500-element worst-case heap for the heap- 
deconstruction phase of Heapsort. 


Appendix B, page 112 and on, discusses hereditary worst-case heaps. Its 


findings explain why any greedy strategy must have failed why generating 
worst cases for the RemoveAll. 


2. Heaps and Heapsort - a brief review 

I am going to use extensively some standard undergraduate math of anal¬ 
ysis of algorithms in this paper. Here is a quick reminder of some basics: Igx 
is a logarithm base 2 of x; [xj is the greatest integer not greater than x; [x] 
is the least integer not less than x; thus is the greatest power of 2 not 

larger than x and is the least power of 2 not less than x; and % is the 
remainder modulo function defined for n > 1 by m%n = m — n x [^J. 

Here is an ubiquitous formula that comes handy while dealing with this 
kind of math, true for every positive integer n: 

[IgnJ = [lg(n + 1)1 - 1 , ( 1 ) 

a special case of which yields, for every positive integer n\ 


Llg(2- - 1)J = n - 1. 


( 2 ) 
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The rest of this Section contains some standard dehnitions and basic facts 
pertaining to heaps and Heapsort that those familiar with the subject may 
wish to omit and go directly to the next section on page W Unlike many 
other presentations, this one is prevailingly algebraic. 

A binary tree structure is a non-emptj|^£nite set I of positive integers, re¬ 
ferred to as the index set, that is closed under positive integer division by 2 
under convention that 1 positively-integerly divided by 2 is equal to 1. A 
finite binary tree, to which I will simply refer to as binary tree, is a function 
T whose domain is I. The elements of T (ordered pairs {i,T[i]), that is) are 
called nodes. For the sake of simplicity of presentation, I will assert that 
the said function is 1 — 1, which in more humane terms means that the tree 
in question is duplicate-free. In particular, T has the inverse function T~^. 
Thus T[i] is the value stored in T at index i and T~^\p] is the index of 
the value p in tree T. This 1 — 1-ness assertion will allow me to sometimes 
not distinguish between the nodes and their constituent indicies and values 
if it is clear from the context which of the three notions am I referring to. 
Since the sole purpose of this article is analysis of the worst-case number of 
comparisons of keys performed by the Heapsort sorting algorithm and its 
components, the above assertion does not lead to a loss of generality. More¬ 
over, since the Insert(x) operation is not used in Heapsort, so is neither 
included nor discussed in this analysis, I maj|^ assume, without loss of gen¬ 
erality, that T is a permutation on I 
with I. 


19 


that is, that the range of T coincides 


The index set of a binary tree defines the parent-children relationship 
between its nodes. A node p at index i is a parent of a node q at index j, or 


^^Allowing empty trees and heaps does not add any benefits to the presentation of this 
paper. 

^®This coincides with the usual mathematical-logical definition of binary tree (cf. [7]) 
as a set of binary sequences of length < a (where a is an ordinal number) for a < ui 
closed under truncation, taking into account that every finite binary sequence is equal to 
the binary representation of some integer with the leading 1 omitted; however, mine does 
not include the empty tree. 

use the square brackets in lieu of parentheses here because of popularity of an array 
representation of binary trees. 

^®And, usually, will. 

^®Thus the inverse T~^ of T is a permutation, too; for example, the inverse of 
[8,6,7,4,5,2,3,1] is [8,6,7,4,5,2,3,1]. 
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- in other words - a node q at index j is a child of a node p at index i if, and 
only if, 

(3) 


• I 

^=L2J. 


I i I 

An iterative application of (3), taking into account that = [fj, gives 

rise to the dehnition of the usual ancestry relation between nodes at indicies 
i (an ancestor’s index) and j (a descendant index) in terms of existence of 
k > 0 for which the equality 

>=liii w 

is satished. The equation (|^ yields the inequality 

<j<{i + 1)2^ 

which implies 

k <lg^ < k + lg(l + -)< k + 1, 

i i 


(5) 


( 6 ) 


or 


k = [Ig -J 
% 


20 


(7) 


Thus, for any i, J E I, i is the index of an ancestor of a node at index j if, 
and only if, 

( 8 ) 


• _ I ^ I 

* ’ 


Plugging ([^ into (|^ yields another, equivalent to (|^, characterization of the 
ancestry relation: 

fJ < j < (i + 1)2^-^^ *-1. (9) 


( 10 ) 


The equality (|^ has two solutions; 

jo = 2^ and ji = 2i + 1. 


The node q is called the left child of p at index i if, and only if, its index is 
jo, and is called the right child of p if, and only if, its index is ji. 


Q is satisfied then k given by is the distance, that is, the length of path, from 
i to j; note that (Q does not imply (|4 \. 
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root 


The root is defined as the node with no parent; thus 1 is the index of the 
A leaf is defined as a node without children; thus its index i satisfies 


21 


the condition 


2i ^ I and 2z + 1 ^ 


( 11 ) 


where / is the index set of the tree. 

A pgf/ip^ in a binary tree T is a sequence of indicies of T defined by 
induction: (1) is a path in T, if a is a path whose last element is i and j is 
an index in T with i = [|J then the concatenation cT'^(j) is a path in T, and 
nothing else is a path in T. 


It follows that binary representations of the indices of a binary tree pro¬ 
vide the navigation information how to get to those nodes from the root, 
with 0 meaning “go to the left child” and 1 meaning “go to the right child”, 
except for the first 1 that means “go to the root”, as it has been visualized 
on Figure [TJpage IT In that sense, each of those indices encodes the path 
(from the root) to the node at that index. Naturally, the length of such path 
is one less than the number of digits in the binary representation of i, that 
is, it is equal to 

Di=\\gi\, (12) 


where i is the index of the destination node (the last index in the said path). 
I will call Di the depth of the node at index i. Moreover, I will call a level k 
of the tree the set of all its nodes (or, sometimes, indicies thereof) that have 
depth k. Thus every node belongs to the level [IgiJ, where i is the index of 
that node. 


In particular, the (binary representation of the) largest index A^ in a 
binary tree encodes the path (from the root of the tree) to its last node. 
Because that path is a longest path in the tree, its length 


DN=[\gN\, (13) 

also the depth of the node at index N, is the depth of the entire tree. Thus 
the depth of the tree is the level number of the last non-empty level of that 
tree. 


^^Because = 0 and 0^1. 
^^From the root. 
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Given a binary tree T, the path of the largest child is defined as the path 
a (from the root) to a leaf such that each node in a, except for the root, is 
the largest child of its parent. One can easily conclude from the 1 — 1-ness 
assertion that every binary tree has the unique path of the largest child. If a 
may not go all the way down to a leaf then I will call it a path of the largest 
child. 


Each node p at index i in a binary tree T is the root of a subtree that 
consists of p and all its descendants, and - when treated as a separate entity 
- has it own index set It can be computed with a help of the formula 
([^ page 12 for the length of path from i to j as 


j(b _ I j G / and i = [ 


J 


2UgfJ 


J}, 


(14) 


where I is the index set of T and the ancestry relation between the respective 
indicies i and j is given by 


page 


12 


The height of a node p at index i in a binary tree T is defined as the 
depth of the subtree of T. 


I call a binary tree a convex binary tree if, and only if, its index set is a 
convex set of integers. Thus every convex binary tree on N nodes has the 
index set equal to 

/ = {!,...,iV}. (15) 


One can conclude from (14) that any subtree of a convex tree is convex. 

It follows from (12) and (14) that the depth Df of a subtree of a 
convex tree T of N nodes is equal to 


of = max{[lg(2l-^® j | j G / and i 


-2UgfJ 


J} 



= max{ [Ig -J I j G / and i } = 

t 2 L o f -I 

[since (|^ and ([^ page are equivalent] 

^ max{j I j G / and (g} ^ ^ jmaxj 


(16) 


i> 2 then jg not necessarily a permutation on its index set 
24Since 0 < j%2L'siJ < 2L'giJ so that 2L'giJ < 2L'8 tJ + j% 2 ^^si] < 2LigiJ+i or 
LlgfJ <lg(2L^®h +j%2LigTJ) < [lg|J +1, that is, [Ig fJ = [lg(2L'gT +j%2L'sT)J). 
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Since the maximal j E I that satishes 0 must also satisfy 

< AT < (17) 

(or otherwise it would not be maximal that satishes ([^), I conclude from 


(17) that 


I 1 jmax 1^1 -^^11 Jmax i , 

[ig < ig — < Lig +1 

% % % 


or 


Lig^J = Lig-J 


25 


(18) 


Applying (18) to (16), I obtain one of the fundamental formulas for the 


precise worst-case analysis of the heap construction phase of Heapsort 


Df = [Ig 


26 


(19) 


If the convex binary tree in question has more than 1 node then the 
number of nodes in its last level is odd if, and only if, the number of nodes 
in the entire heap is even. 

It follows from ( [TT| and (15) that i is an index of a leaf in a convex binary 
tree if, and only if, 

2i > N, (20) 

where N is the size of the index set. 

The leftmost descendant j of a node at index i in a hnite convex binary 
tree T is the hrst (leftmost) node in the last nonempty level of subtree 
By virtue of (|^ page 12, (18), and (19), its index j is given by this formula; 


j = i X 2^^^ = i X 2^»^.p| 


( 21 ) 


Every non-empty level k in any convex binary tree, except, perhaps, the 
last non-empty level, is equal to {2^, ...^2^^^ — 1}, while the last non-empty 
level [IgiVj in such a tree is equal to ,..., A^}. 


^^Note that [IgjmaxJ may be lass than [IgTVj, so that the intuitively simple argument 
that presumes their equality is invalid. 


^®Cf. [in] for a more conventional (and longer) derivation of (19l. Note that (191 is not 
a direct concludion from (Q as i does not have to be an ancestor of N. 


27 


See proof of Lemma 5.1 in |10| for a conventional derivation of the equality j = ix2 


D' 
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Moreover, I call a complete binary tree a convex binary tree whose index 
set I has the size N = 2^^^^— 1, that is, 

/ = {l,...,2Lig^J+i - 1}. (22) 


Naturally, any subtree of a complete binary tree is a complete binary tree. 
Of course, > 1 is the size of the index set of a complete heap if, and only 
if, + 1 is a power of two, that is, [lg(A^ + 1)J = |'lg(-^ + 1)1; by ([^ 
page[^ 

LlgiVj = Llg(iV + l)J-l. (23) 

Also, the largest complete binary tree of no more than N nodes has 

M = _ 1 (24) 


nodes, since is the greatest power of 2 not greater than A^ + 1 so 

that 2 bg(^+i)J is the greatest power of 2 minus 1 not greater than N. By ([^ 
page 10 and (13) page 13, the depth of such largest complete binary tree of 


no more than N nodes is 


Lig(2Ug(^+bJ _ 1)J = Lig(iv + 1)J - 1. (25) 


A heap is a convex]^ partially ordered binary tree. Partially ordered 
means that every sequence of values along any path in the tree is ordered in 
a decreasing order. 

An example of a heap is visualized on Figure 

Quite obviously, any non-empty convex binary H tree may be represented 
as a one-dimensional array whose indicies range over the index set of if, that 
is, from 1 to N, the number of nodes of the tree. 


^®Some authors use adjective complete in this context, instead. A neat definition bor¬ 
rowed from mathematical logic allows one to identify the set of indicies of a countable 
fc-ary tree with a set of positive integers closed under positive integer division by k, with 
I assumed equal to 1 if 1 < i < fc. In light of such a definition, a finite convex fc-ary tree 
T has a set of indicies I that comprises of the first n positive integers, which one could 
describe as complete, although I is in fact a convex set of integers so the adjective convex 
appears like a better descriptor of tree T. I will reserve adjective complete to binary trees 
with 2^ — 1 nodes, where Z? is a positive integer. It is worth noting that in mathematical 
logic trees were studied long before they were used in Computer Science, using a defini¬ 
tion of a complete /c-ary tree that ours is compatible with; in particular, according to that 
definition (cf. |^, p. 381), a finite complete fc-ary tree of depth D has — 1 nodes. 
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Figure 1: A heap of 12 nodes, with values and the navigation information shown. 


index 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

value 

12 

11 

7 

9 

10 

2 

3 

6 

8 

4 

5 

1 


Figure 2: Array representation of the heap of Figure 

The table in Figure shows an array that represents the heap of Figure 
[^with the indices of the array shown in the top row of the table. 

If tree H is partially ordered then its every subtree where i is in the 
index set of H, is partially ordered as well, so a subtree of a heap is a heapp^ 
I will call a subheap of H. 

Heapsort (see, e.g., j3] for its description and partial analysis) consists 
of two phases: heap construction and a sequence of removals from the con¬ 
structed heap that I call heap deconstruction. 

Both phases use a subroutine FixHeap that inherits an almost heap, de- 
hned as a heap whose root, referred to in some contexts as a patch, may 
violate the partially ordered tree condition in the dehnition of heap, and 
turns it onto a heap by bubble-sorting its root into the path of the largest 
child. This is done by demoting the said patch down the heap while pro¬ 
moting the largest of its current children until the demotee reaches the level 
where it is not less than any of its current children, if it still has any at 


^®Except that it is not necessarily a permutation on its index set. 
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that level. Since each step in that process requires comparing, directly or 
indirectly, the demotee to its all childrerp^ the total number of comparisons 
of keys that FixHeap performs during one call is equal to the total number 
of children of elements of the path of demotion that it follow^^ Because 
FixHeap is the only part of Heapsort that performs comparisons of keys, the 
above characterization is the point of departure of the analysis presented in 
this paper. 

The heap-construction phase, referred to as MakeHeap in this paper and 
credited to Floyd |1], inherits an array that represents a convex binary tree 
H and rearranges it onto a heap by calling FixHeap for its parts that repre¬ 
sent subtrees of H that have been already rearranged onto almost subheaps, 
beginning from the one that has the last non-leaf (stored at the index [yj 
in the array) of H as the rootf^ and ending with the entire tree H (the root 
of which is stored at index 1). This is accomplished by the following Java 
statement: 


for (int i = N/2; i > 0; i--) FixHeap(i); (26) 

The heap-deconstruction phase, referred to as RemoveAll in this paper, con¬ 
sists of N calls to a subroutine RemoveMax. Each of these calls removes the 
current root of the heap, patches the resulting vacancy with the current last 
node of the heap, and calls FixHeap(l) in order to turn the resulting almost 
heap onto a heap after each removal. The removed nodes are then stored 
in the array heap from the last index up in the order they were removed, 
which process yields an array that is sorted in an increasing order. This is 
accomplished by the following Java statement: 

for (int i = N; i > 0; i--) heap[i] = RemoveMaxO ; (27) 

A complete code of HeapSort may be easily found in about every standard 
text on Data Structures and Algorithms, or in jl2j . 


^°Clearly, the demotee must have been compared to its largest child; knowing which child 
is the largest does require comparing children to each other, if there are two children, that 
is. 

^^Which is a path of the largest child. 

^^MakeHeap could have begun calling FixHeap from the last node of H, but this would 
produce the same sequence of comparisons of keys and demotions because FixHeap does 
not do anything to a one-node tree. 
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3. Notation and basic facts 

I am going to measure the running time of Heapsort and its components 
by the number of comparisons of keys that they perform, using the following 
notation. 

For any operation X, C'x(F') denotes the number of comparisons of keys 
that X performs while executed on its input Y. For instance, C'FixHeap(7"*'*^), 
which I will also denote as C'FixHeap(*)(^); is the number of comparisons of keys 
that the FixHeap performs while turning the almost subheap T^) rooted at 
node i of a convex binary tree T onto a heap. 

Moreover, denotes the maximum number of comparisons of keys 

that X performs while executed on its any valid input of size N. It is given 
by this formula: 

C'^'^{N) = mayi{CxiY) \ Y is X’s valid input of size N}. (28) 

For instance, 

CRemoveAii{)(^) = max{CRe„,oveAii()(hf) \ H IS a heap on N nodes}. (29) 

Residu^^ of a heap H is either the heap that is the result of one applica¬ 
tions of RemoveMax to hf, or a residue of a residue of H. 

bj is the set of all non-negative integers, is the set of all positive 
integers. A (non-empty) sequence is a function whose domain is a convex 
subset of that contains 1. If s is a sequence and N C then s f is 
the result of restricting (of the domain of s) to N. v'^'w is the concatenation 
of sequences v and w. 

4. A useful trick: Running Heapsort backwards 

It turns out that FixHeap is invertible, and so are MakeHeap, RemoveMax, 
and RemoveAll. This fact allows for running the entire Heapsort backwards 
in order to produce inputs that force it to follow predetermined paths of de¬ 
motions within the heap. This is useful in construction of cases that establish 
lower bounds on its worst-case behavior. Although the transition relation for 


Proper residue would be a more adequate but longer term. 
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Heapsort, a deterministic algorithm, is a function, its inverse relation is not 
a function. So, some extra information is required in order to execute it 
backwards. 


The basic operation 1 will use to accomplish all the above is PullDown 
whose Java code is shown on Figure on page ^ It takes a convex binary 
tree H and two indicies in if, i and j, with j a presumed descendant of 
^j3i| removes and returns node H[j] (instructions in lines 476 and 481 of the 
referenced above Java code), and demotes j’s all proper ancestors in H that 
are descendants of i (the /or-loop in lines 478 and 479 of the Java code). 


473 

474 

475 

476 

477 

478 

479 

480 

481 

482 


□ 


// 

// 

// 


public int PullDown (int i, int j) 

Removes and returns heap[]] and demotes its all ancestors up to i 
{ 

int toReturn = heap[j]; 

Demote all ancestors of i 

for (int m = j; m/2 >= i; m /=2) 
heap[m] = heap[m/2] ; 

Done! 

return toReturn; 

} 


Figure 3: A Java code for operation PullDown(i, j); i and j are indicies. 

The following Subheap Repair Lemma provides a useful characterization 
of the implementation of the operation PullDown. 

Subheap Repair Lemma 4.1. Let H be a convex binary tree of N > nodes, 
i < L'f'-I index such that the subtree rated at i is a heap, j be a 

proper descendant index of i in H, and H' be the convex binary tree that is 
the result of executing the following Java instruction 

H.heapEj] = H.PullDownd, j) ; (30) 

on H, where the Java code of method PullDown is visualized on Figure^ 

(i) The execution o/FixHeap on H' yields H, that is, 

ifkFixHeap(i) = H. (31) 


actually works fine even if j is not e descendant of i; in such a case, it works as if i 
were equal to 1. 
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(ii) The number CFixHeap(i) (-^0 comparisons of keys performed by the 
execution o/FixHeap on H' is given by this equality: 

= 2([lgiJ - [IgiJ - 1) + + #7, (32) 

0 if 2m > N 

1 if 2m = N (33) 

2 if 2m < N 

is the number of children of the node at index m in a heap of N ele¬ 
ments. 

(iii) If J is the leftmost descendant of i in H, and, therefore, in then 
the nnm6er CFixHeap(i) of comparisons of keys performed by the ex¬ 
ecution o/FixHeap on H' is maximal, that is, it satisfies the equality 

C^FixHeap«(^^') = C^^HeapW (A') • (34) 

Proof Because is a heap and j is a proper descendant of i in if and, 
therefore, in ifh)^ 

m < H\i]. (35) 

Moreover, for every child k of j in H and, therefore, in H^'^\ 


if 


35 


= 


H[k] < H\j]. 


(36) 


Since, as an effect of instruction (30), H'[i\ = H[j], H'[i] = H[j], inequality 


(35) implies 


H'\j] > H'[f\, 


(37) 


and inequality (36) implies, for every child k of j (unaffected by instruction 

(38) 


(30)) in in H' and, therefore, in if4*)^ 

H'[k] < H'[i\. 


^^4fm = signum(^ — 2) + 1, where signum{x) is the sign of x (-1 if x < 0, 0 if a: = 0, 
and 1 if X > 0). 
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Each node demoted as a result of H.PullDown(i, j) in instruction (30) 
becomes the largest child after the demotion, so that the path a = {i, 
of such demotions becomes a path of the largest child in the subtree 
of H'. This is a consequence of the fact that each demotee was the parent 
(before the demotion) of its sibling (after the demotion), except for the case 
when the demotee became a leaf in if', thus it must be greater than its sibling 
(after the demotion), if it has any. Thus the demotions of H'[i] done by the 
subsequent execution of FixHeap() will follow a and terminate after H'[i] is 
demoted to index j since all children of j are, by virtue of (38), less than the 
demotee As a result, (31) holds. This completes the proof of [(I)| 


The number of comparisons of keys performed by FixHeap() following a 
path a of the largest child is equal to the total number of children of indicies 
in a. Thus, 


CFixHeap(*)(^f') = X] + #fij + #f • (39) 

A:ecr\{[|JJ} 


Since all elements of a, except, perhaps, for [|J and j, have 2 children each, 
X]fceo-\{[ij is equal to twice the number of levels between the level of i 

and the level of [|J (not including the latter), that is, 

= 2(LigL^JJ-Lig*J) = 2(Lig^J-Lig^J) = 2(LigjJ-Lig*J-i)- 


Thus, 


Y1 = 2([lgjJ - [IgiJ - 1). 


(40) 


11 . 


Substituting (|40|) to the right-hand side of (|39|) yields (|32|). This completes 
the proof of 


Clearly, the path from i to its leftmost descendant j has the total number 
of children in the subtree if'h) of H' at least as large as any other path in 
Pas Therefore, the equality (34) holds. This completes the proof of 
and the proof of the Lemma. 


Ill 


□ 


Note. Since FixHeap(i) performs the maximum number of comparisons 
if it demotes i to the index of its leftmost descendant, it may be computed 


directly from the equality (32) page 21, substituting j = i x 2^8 C given 



















M. A. Suchenek: A Complete Worst-Case Analysis of Heapsort (MS) 23 


by the equality (21) page 15 and using the equality (34) of part (hi) of the 
Subheap Repair Lemma |4.1 that 


'^S=..p(i)(A') = LigTJ + L>g^J. 

0 L 


(41) 


but since I already did it in HD], Corollary 5.2, I refrained from redoing it 

(42) 


here. Also, substituting 1 for i in the equality (41) yields 


C^FSHeap(l)(^)= + 


For running backwards the heap-construction phase MakeHeap, I will use 
an operation i/.unFixHeap(z) that takes a subheap rooted at node i < 
[yj of a convex binary tree if of nodes and turns it onto an almost subheap 
if'*^*) of the resulting convex binary tree H' = ii.unFixHeap(i), while leaving 
the remainder of H unchanged, with the following two constrains satished; 


(ii.unFixHeap(i)).FixHeap(i) = H (43) 


and 

C'FixHeap{i)(hf.UnFixHeap(*)) = C'“[^Heap{i)(^)i (44) 

where C'FixHeap(*)(R) is the number of comparisons of keys that the FixHeap 
performs while turning the almost subheap rooted at node i < [yj of a 
convex binary tree T onto a heap, and C'FixHeap(i)(-^) maximum of 

CFixHeap(j)(R) over all convex binary trees T oi N nodes whose subtrees rooted 
at index i are almost heaps. 


Operation unFixHeap is not unique. Any one that satishes the constrains 
(43) and (44) will do. I am going to prove that the Java code shown on 
Figure 1^ implements operation unFixHeap that satishes those constrains. 

Let if be a convex binary tree of N nodes whose subtree if^*^ roted at 
index i < [yj of ii is a heap, and let j, given by ( pT| page 15 and computed 
by the instruction in line 489 of Java code shown on Figure ^be the leftmost 
descendant of i. Since j is a leaf and i is not, j is a proper descendant of z, 
so that Subheap Repair Lemma |4.1| does apply. 

Application of Sub heap Repair Lemma [4A||(i)| yields ( [4^ . Thus constrain 
(43) is satished. 
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484 


485 

// 

486 E 

3 

487' 

// 

488 

// 

489 


490 

// 

491 

// 

492 


493 

// 

494 


495 



public Heap unFixHeap(int i) 

Unfix heap's subheap rooted at i onto a worst-to-fix almost subheap 

{ 

Find the leftmost leaf j under i 

MyMath.twoToFloorLg(N/i) is the largest power of 2 not larger than N/i 
int j = i’^MyMath.twoToFloorLg(N/i); 

Demote all ancestors of j up to i, including i 
Make the former leaf j the root of the almost subheap 
heap[i] = PullDown(i,j); 

Done! 

return this; 

} 


Figure 4: A Java code that implements operation unFixHeap. 


Instruction in line 489 
The static 


computes the index of the leftmost descendant of i using the formula (211. 
method twoToFloorLg(n) in class MyMath computes The method PullDown(i, j) 

removes and returns the node at index j and demotes its all ancestors up to and including 
an ancestor at index i; it is shown on Fig. [^on page and is discussed in Section 


Application of Subheap Repair Lemma 4.1 (iii) yields (44). Thus con¬ 


strain (44) is satished. 


For running backwards the heap-deconstruction phase RemoveAll, I will 
use the operation iL.unRemoveMax(t), visualized on Figure that undoes 
the effects of any given operation RemoveMax that produced a given heap 
H. More specihcally, iL.unRemoveMax(t) takes a heap H on N nodes and an 
index i of its node H[i] [^that satishes the constrain 


N +1 

m < 


(45) 


and produces a[^ heap 


H' = iL.unRemoveMax(t) 


(46) 


on Af J- 1 nodes that satishes these two constrains: 


H'[N + 1] = H[t], 


(47) 


^®The presumed patch used by FixHeap(l) called by the RemoveMax() that the 

unRemoveMax(i) presumably undoes^ _ _ 

The, as the Uniqueness Lemma 4.2 page 25 states. 
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4971 

498 □ 

499 

500 

501 

502 

503 

504 

505 

506 
507' 

508 

509 

510 L 


// 

// 

// 

// 


// 


public Heap unRemoveMax(int i) 

{ 

Finds the last node w/o right child 
int p = (N + l)/2; 

Checks if it can become the parent of i 
if (heap[p] < heap[i]) return null; 

Demotes all ancestors of j and 

makes the former leaf j the root of the almost subheap 
N++; 

heap[N] = PullDown(l,i); 
heap[l] = N; 

Done! 

return this; 

} 


Figure 5: A Java code that implements operation urLRemoveMax(z). The method 
PullDown(l, i) removes and returns the node at index j and demotes its all ancestors; 
it is shown on Fig. on page [20| and is discussed in Section 


and 


i7'.RemoveMax() = H. 


(48) 


The constrains (47) and (48) are self-explanatory. The constrain (45) 


is an input constrain for unRemoveMax(t) and needs a comment. It allows 
unRemoveMax(t), line 506 of the Java code visualized on Figure]^ to make 
H[i] a child, in heap H', of the hrst node of heap H without the 


right child. This, for any H and i that satisfy the constrain ( |45[ ), assures the 
existence of H' that satishes constrains (47) and (48). If 77 was produced by 


RemoveMax() that used H[i\ as the patch then, of course, (45) is satished. In 
the case of i (the said node without the right child is the patch), 

this constrain reduces to a tautology H [L^J] < In the case 

i = -|- (A^ mod 2) (the left child of the said node without the 

right child is the patch), N is even and the constrain reduces to H[N] < 77[y], 
true for every heap H on N nodes. These, for any 77, assure the existence of 


index i in 77 that satishes the constrain (45) 


The i and 77', whose existences have been demonstrated above, are unique, 
as the following Lemma states. 

The Uniqueness Lemma 4.2. . 

(i) For every heap H on N nodes and its every index i that satisfies the 


and (4-8). 


constrain (45), there is the unique heap 77' that satisfies constrains (45) 
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For every heap H' on N > 2 nodes there is a unique index i < N — 1 in 


the heap H defined by {4S) such that the constrain (45) and the equality 


(46) are satisfied. 


Proof, (i) The existence of such a heap H' follows form the foregoing dis¬ 


cussion. Now, suppose that heaps on iV -|- 1 nodes H' and its substitute G 


satisfy (46) and (48). Let a be the path (from the root) to i. Obviously, a is 
a path in all three heaps, Lf, if', and G. Since RemoveMax() does not modify 
its explicit argument [FF or G) except for the elements at indices along path 
(T, I infer that for every index k in H with k ^ a, 


that is. 


H'[k] = H[k] = G[k], 


H'[k] =G[k( 


Also, by virtue of constrain (47) 


H'[N + 1] = GiiNT 1]. 

Let j G a, that is, j > 1 be an ancestor of i. We have: 


(49) 

(60) 


Also 


H'\j] = H\j/2] = G[j-|. 


H'\l] = Af + 1 = G|l|. 


Thus the equality (49) is also satished for all i E a. Therefore, 

H' = G. 


(51) 

(52) 


Hence the uniqueness of H'. 

(ii) The existence of such an i follows from the foregoing discussion. The 
uniqueness follows from part (i) of the Lemma (already proved) and constrain 


□ 

The above Uniqueness Lemma assures that the heap H' postulated by 
constrains (47) and (48) does exist and is unique. I am going to show that 
the Java code visualized on Figure [^produces an J- 1-node heap H' out of 


^®Recall that any heap is a 1-1 function. 
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given A^-node heap H that satisfy constrains (47) and (48) if the constrain 
(45) is met; otherwise the move i is invalid and unRemoveMax(«) returns the 
null value as a result of execution of instructions in lines 500 and 502 of the 
Java code of Figure page 2^ Without loss of generalitjQ I will use an 
example of implementation of RemoveMax() whose Java code is visualized on 
Figure 


50 

51 □ 

52 

53 

54 

55 

56 

57 

58 

59 L 


public int RemoveMaxO 
{ 

if (N==0) return -9999; 

//here count != 0 

int maxitem = heap[l]; //remove the root and 
heap[l] = heap[N]; //fill the vacancy with the last 
N--; 

FixHeap(l); 
return maxitem; 


} 


Figure 6: A Java code that implements operation RemoveMax(). The method FixHeap(l) 
implements the operation described at the end of Section 


H' is a heap because H is a heap, and instruction in line 507 of the Java 
code of Figure page ^ makes its root larger than any other node of H', 
and instruction in line 506 of that code attaches a new child to the node p at 
index N that, by asserted constrain (45), is less than p, and the rest of that 
code does not affect the ordering of the paths in H'. 


Constrain (47) is secured by instructions in lines 505 and 506 of the above 
referenced Java code. In order to show that constrain (48) is met, let us look 
into the implementation of operation RemoveMax of Figure]^ 


Clearly, instructions in lines 55 and 56 of the Java code is shown on 
Figure [^reverse the effects of the instructions in lines 505 and 503, as well as 
the effects of the assignment to heap[N] in line 506 of the Java code is shown 
on Figure At this point, the resulting tree is the same as if instruction 


H.heapEl] = H.PullDownd , i) ; (53) 


were applied to the original heap. Therefore, the Subheap Repair Lemma 
4.1 applies (substituting 1 for i and i for j), and its part |(i)| implies that line 


^®Since for every heap H, the result of execution of RemoveMax() on H is determined by 
H, any two correct implementations of RemoveMax() are functionally equivalent. 
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57 of the Java code is shown on Figure restores the original heap. Thus 
constrain (48) is satisfied. As a result, RemoveMax() and unRemoveMax(i) are 


inverses to one another, as the following Lemma states. 

The RemoveMax Invertibility Lemma 4.3. . 

(i) For every heap H on N nodes and every index i < N that satisfies the 


constrain (45) page\^ 

(/7.unRemoveMax(«)).RemoveMax() = H. (54) 

(ii) For every heap H on N > 2 nodes there is a unique index i < N — 1 


such that the constrain (45) page 24 is satisfied and 


(7/.RemoveMax()).unRemoveMax(i) = H. 


(55) 


Proof. Let H he a heap on N nodes and FF be a heap defined by ( [4^ 
page|^ Substituting (46) to (48) page yields 


11 


Let FF be a heap on > 2 nodes, H be the heap defined by (48) 


and i be the unique index whose existence is assured by the Uniqueness 
Lemma 4.2| (ii) Combining (46) and (48), both of which are satisfied, yields 


H' = i7.unRenioveMax(i) = (i7'.RemoveMax()).unRenioveMax(i), (56) 


or 


H' = (i7'.RemoveMax()).unRemoveMax(i) 


Since FF was any heap on N >2 nodes, (57) implies (55). 


(57) 

□ 


The following Lemma will be useful while proving correctness of construc¬ 
tion of worst-case heaps for RemoveAll. 

The RemoveMax Cost Lemma 4.4. Let H be a heap of N > 2 nodes and 
let H' = i7.unRemoveMax(i), where 2 < i < N. The number CRemoveMaxO (-ff0 
of comparisons of keys that the operation RemoveMax() performs on H' is 
given by this equality: 


CRemoveMaxO (-ff 0 ” 2([lgiJ — 1) + , 


(58) 


where given by (33) page 21, is the number of children of the node at 


index m in a heap of N elements, with convention = 0. 




















M. A. Suchenek: A Complete Worst-Case Analysis of Heapsort (MS) 29 


Proof. The only comparisons of keys within RemoveMax() are performed by 
the call to FixHeap(l) in line 57 of the Java code visualized on Figure 
Thus 

C*RemoveMax() ^FixHeap(l) (77"), ( 59 ) 

where 77" is the heap on N nodes with 77" [fc] = 77'[fc] for 2 < k < N and 
77"[1] = 77'[iV+l]. Since, as noted in the discussion above, 77.unRemoveMax(i) 


performed actions that were comprised by (53), the Subheap Repair Lemma 


4.1 


11 


applies (substituting 1 for i and i for j) and the equality of its part 
taking into account equality 


yields 


□ 

Note. Equalities (|59|) and (|4^ page|^(the latter with substituting N—1 


for N since the almost heap sent to FixHeap(l) has one less node than the 
one sent to RemoveMax) yield for every N > 3: 


^max 

'^RemoveMcLx 


„(]V)=Llg(A'-l)J + Llg(A'-2)J. 


(60) 


5. Decomposition of the worst-case analysis of Heapsort 

It is easy to generate worst-case input arrays of arbitrary size for MakeHeap. 
A Java code presented and proved correct in Appendix A of m does just 
that. However, efficientljj^ generating worst-case heaps for RemoveAll and 
worst-case input arrays for the entire Heapsort, except for some special sizes, 
have been, to my best knowledge, unknown. Moreover, although the sum of 
upper bounds of program’s components is an upper bound on the running 
time of the entire program, the converse is not necessarily true. For instance, 
for > 13 an upper bound on the number comparisons of keys performed 
by RemoveAll on an A^-element heap is less than the sum of any upper 
bounds on numbers of comparisons of keys performed by the sequence of N 
RemoveMaxes that comprise it. And this singularity^] is one of the reasons 
why the worst-case analysis of Heapsort is not totally a routine task. 

Fortunately, the problem of generation of worst-case inputs for Heapsort 
can be decomposed on two subproblems; how to, given any > 1, generate 
a worst-case heap 77 of size N for RemoveAll, and how to, given a heap 77, 
generate a worst-case array for MakeHeap that MakeHeap converts onto 77. 
Clearly, solving those subproblems in that order will result in a worst-case 


opposed to, say, exhaustive search. 

^^This singularity is characterized by Theorems 


10.13 


and 


10.14 


page 


87 
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array A of size N for Heapsort. Careful evaluation of the number C'Heapsort(^) 
of comparison of keys performed on the resulting array will allow me to derive 
the exact formula for 

The mentioned above decomposition property implies that any upper 
bound on the number comparisons of keys performed by Heapsort on an N- 
element array is equal to the sum of upper bounds on numbers of comparisons 
of keys performed by that MakeHeap and RemoveAll on an A^-element array 
and an A^-element heap, respectively. In particular, the least upper bound is 
equal to such a sum, thus yielding 

Cr.;.or^iN) = C—Heap(A^) + ( 61 ) 


Based on the above observations, in order to prove (61) it suffices to 
demonstrate that, given a heap H, the Java loop statement visualized on 
Figure constructs a worst-case input array for the MakeHeap, that is, an 
array that the MakeHeap will convert onto H while performing the maximum 
possible number of comparisons. The equalities (31) and (34) of the Subheap 
Repair Lemma 4.1|(i)|and (iii), page 21, via constrains (43) and (44), page 23 


they entail, make the latter a routine exercise. 


450 


for (int i = 1; i <= N/2; i++) unFixHeap(i); 


Figure 7: A Java statement that constructs a worst-case input array for MaikeHeap given 
an output heap. The method unFixHeap(i) is shown on Fig. 


Figure 1^ shows an output generated by my Java program containing the 
above code. 

First, I will prove, by induction on iF, that for every K E { [yj —1,..., iV}, 
that the following program Pk'- 

for (int i = N/2; i <= K; i++) unFixHeap(i); (62) 

for (int i = K; i >= N/2; i--) FixHeap(i); (63) 

leaves the heap H, on which it is run, unchanged, that is. 


H = H\ 


( 64 ) 
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12-eT.emQnt heap iGyeT.-by-T. 0 vel.: 


Level 0 
Level 1 
Level 2 
Level 3 


12 


MakeWorstArray: 6 9 1 11 5 2 3 12 8 10 4 7 

Heapsort: heap constructed in 18 comps and 22 moves 
Heapsort: theoretic worst-case n. of comps = 18 
Heapsort: theoretic worst-case n. of moves = 22 

Heapsort: array sorted in 59 comps 

Heapsort: Theoretic worst-case number of comps = 59 


Figure 8: A 12-element worst-case array [6, 9, !> H) 5; 2, 3,12, 8,10,4) 7] for the heap of 
Fig .[^on page[^(a worst-case heap for the deconstruction phase of Heapsort); a fragment 
of output generated by my Java program containing the code of Fig. [^on page[30| 


where H' is the value of H after the execution of Pk on it, and that the line 


(63) in Pk forces the FixHeap to perform the total of 


K 

C(N, K) = ^ 

*=LfJ 


(TV) 


(65) 


comparisons of keys. 

It K = [yj —1 then program Pk performs no actions, so that the invariant 
(64) is satisfied, and the numbers of comparisons done by the FixHeap is 0, 
which is equal to the right-hand side of (65). 

It K G {LyJ: fhen program Pk is functionally equivalent to: 


for (int i = N/2; i <= K-1; i++) unFixHeap(i); (66) 

unFixHeap(K); (67) 

FixHeap(K); (68) 

for (int i = K-1; i >= N/2; i--) FixHeap(i); (69) 


By (43) page 23, line (68) cancels out the effects of line (67), and by the 


invariant (64) of the inductive hypothesis, line (69) cancels out the effects of 
line (66). So, program Pk leaves the heap H it is run on unchanged and the 
invariant (64) is satisfied. 
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By (44) page 23, putting i = K, the number of comparisons of keys 


performed by FixHeap in line (68) is equal to 

^Fi^Heap{iC)(^)) 


(70) 


and by the inductive hypothesis, taking into account that line (68) cancels 
out the effects of line (67), the number of comparisons of keys performed by 
FixHeap in line (69) is equal to 


K-l 

<-L#J 


'max 

FixHeap(i) 


(A'). 


(71) 


SO that 


C(lV,A') = C-S,..rtK)(lV) + 


K-l 

E 

-LfJ 


^max 

^FixHeap(2) 


(N), 


or (65). 


K 


C(N,K) = 5^ C;“„..p(.)(A'). 


(72) 




This completes the inductive proof of (65) and the invariant (64). 

Since for every N >2, C{N, N) is a lower bound on the number C, 
of comparisons that MakeHeap performs on any iV-element array and 

/^max 

FixHeap(i) 


max 
MakeHeap 


(iV) 


(N) is an upper bound on Cj 


max 

MakeHeap 


(N) , I conclude from 


(72), putting K = N, that for every N > 2, C{N,N) is the least upper 


bound on C'fekeHeap(^)i that is. 


C{N, N) = C, 


MakeHeap 


(IV). 


(73) 


Thus, for every heap H, the Java loop statement visualized on Figure 
constructs a worst-case input array for the MakeHeap that MakeHeap converts 
onto T/ru Hence, the equality (61) holds. 


By virtue of Theorem 7.1 in uni, 


^max 

'^MeikeHeap 


{N) = 2N-2s2{N)-e2{N), 


other words, every heap is a worst-case heap to build for the MakeHeap. 


( 74 ) 
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Figure 9: The number of comparisons of keys performed by MakeHeap in the worst case 
on any fV-element array, also known as the Sloan sequence A092054 (cf. [T]). The lower 
line visualizes the right-hand side of (721 for K = [iVJ, while the upper line visualizes the 
linear interpolation of the right-hand side of (74) between integer points. 


where S 2 {N) is the sum of all binary digits of N and e 2 {N) is the exponent of 2 
in the prime factorization of N. Thus the array A produced by the Java loop 
statement visualized on Figure run on any heap H of N distinct elements, 
when given as the input to MakeHeap forces it to perform 2N— 2s2{N) — e 2 {N) 
comparisons of keys and to yield the heap H. 

Note. The sequence of integers 


1,2,4, 6, 7, 8,11,14,15,16,18, 20, 21, 22, 26, 30, 31,32, 34, 36, 37,... 


given by the formula ( |74| ) and visualized on Figure appears as the Sloan 
sequence A092054 in [1]. 

At this point it becomes clear that all that is needed for the completion 
of the worst-case analysis of Heapsort is a derivation of a formula for the 
worst-case number of comparisons of keys performed by the RemoveAll. I 
will do just that in the sequel of this paper. 


6. The game of Pull Downs 

Both Heapsort and MakeHeap have nice worst-case decomposition prop¬ 
erties that allow to compute their worst-case numbers of comparisons of keys 
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as sums of worst cases of their components. Unfortunately, the same cannot 
be said of RemoveAll. Although, for arrays of size less than or equal to 12, 
the worst-case number of comparisons of keys performed by RemoveAll is 
equal to the sum of the worst-case numbers of comparisons of keys of the 
RemoveMax’s that make RemoveAll, it is not the case for arrays of more than 
12 element^^ In the latter case, the worst-case number of comparisons of 
keys is performed by RemoveAll is always less than the sum of the worst-case 
numbers of comparisons of keys of the RemoveMax’s that make RemoveAll. 

In order to construct worst-case heaps for RemoveAll and prove that they 
actually are worst-case heaps, I will resort to games and strategies. 


n E u) of heaps in which Hi is the one- 
^ and for every positive integer i, heap 


Imagine the following solitaire game played by the Player 

A game is a sequence H = (H„+i 
element heap whose only node is 1, 

is the result of a valid application Hj.unRemoveMax(ki) of the operation 
unRemoveMax(ki), visualized on Figure [^page 25, to heap H*. In particular, 
each Hj is a heap on i nodes. Each application Hj.unRemoveMax(ki) is an Ath 
move, and I will refer to it in some contexts as a pull down'^^^ if it is a valid 
application, that is, if the instance of the constrain (45) is 

satished. 


The Player draws at random an integer N > 2 and executes a sequence 
of — 1 consecutiv^^ pull downs on the 1-element heap Hi. The result 
is an A^-element heap Hjv. Player’s goal is to maximize the payoff for the 
game dehned as the number of comparisons of keys that the 

Heapsort’s reconstruction phase RemoveAll will perform while run on the 
heap Hat. 

Given a game H, its every Ath move Hj.unRemoveMax(kj) is determined by 


5|page 

29 

IT is (1, 



see 


footnote 


20 


is part of 


^•^As I have indicated at the begining of Section 
^"^Formally, Hi = {(1,1)}, so the only node of h 
its value 1 (the second element in the pair). 

call PullDown(l, i) to method PullDown, visualized on Figure]^ page 
method unRemoveMax; hence the name pull down. 

^®The reason for using both moves and pull downs is to simplify notation. For instance, 
it allows using pull down v (on a heap H) in lieu of move H~^[v]. 

The adjective consecutive in this context means that each next pull down of the said 
sequence is applied to the heap produced by the foregoing pull down. 
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the index kj, and the game H itself is determined by the sequence 
k = (k„+i I n G cu) of indices that are the arguments of respective 
Hj.unRemoveMax(kj)’s. Thus there is a 1 — 1 function (^(k) between the se¬ 
quences k of valid moves and games that is dehned by: 


^(k) = H. 


(75) 


Because of that, I will identify moves Hj.unRemoveMax(kj) with their argu¬ 
ments kj. Also, since I assumed (on page 11) that any heap hf is a 1 — 1 


function, and as such has the inverse H~^, any index fc in is unambigu¬ 
ously identihed by the node H[k] that is stored at k. Thus the game H is 
also determined by the sequence 

V = H(k)|^= (Hi+i[kj+i] I i e (u) (76) 

of patches where kj is the argument of respective Hj.unRemoveMax(kj), simply 


because the sequence k in (75) is determined by 


k = H ‘(v) = (H 'i+ilvi+i] lieu). 


(77) 


This fact allows me to identify pull downs with patches (the values that are 
being pulled down) rather than with their indices. For instance, given a heap 
H, pull down 6 is the move i7.unRemoveMax(i7“^[6]). 


I will also consider hnite subgames H jk of game H that I dehne as hnite 
sequences of heaps (Hj | J < i < K) from H. Clearly, any hnite subgame 
Hjk is determined by the heap Hj and the subsequence kjic_i = (kj | J < 
i < K — 1) oi k = (^“^(H) of valid moves. This gives rise to function 
dehned by 




19 


(78) 


As before, given heap H j, any hnite subgame V\jk is unambiguously deter¬ 
mined by the corresponding subsequence 


= H(kj_i^_i) 


H(k)j,^_i = (H,[k,]| J<^<iF-l) 


(79) 


^^Incorporating (75), H(k) may be written as H(i^“^(H)) that yields v directly from H 
without any references to k. 

^®The K — J moves result in a if — J + 1-element subgame: the original heap Hj plus 
the K — J heaps Hj+i,..., V\k created by that subgame. 
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of pull downs applied consecutivelj^ to Hj, simply because the sequence 
kj^K-i in (78) is determined by 

kj,K-i = = (Hri[k,] \J<t<K-l). (80) 


For example, if the subsequence vi^g of pnll downs is equal to (1,1,1,1, 2,1) 
then the corresponding subsequence ki^g of moves is equal to: 

k,,6 = {Hr‘|l],Hj'[l],H3'|l],H,-‘|l],Hy[2],Hj'|l]) = (1,2,3,4,4,5). (81) 


Given heap Hj, each of these subsequences will serve as a dehnition of 
the last heap V\k in the subgame Hji^. For instance, one can verify (a 
program can do it) that given the 1-element heap Hi, the sequence of pull 
downs (1,1,1,1, 2,1), and the sequence of moves (1, 2, 3,4,4,5), dehne the 
7-element heap Hy visualized on Figure [TTj 

Since RemoveMax is a functionp^ pull downs of two different patches v 
and w (for instance, pull down 1 and pull down 2) produce different heaps 
regardless whether they were applied to the same heap or not. For if both 
pull down V and pull down w produce the same heap 77, then the heap 
G that they were applied to is given by 77.RemoveMax() and, therefore, is 
unique, and so is the patch that was used to £11 the vacancy after the largest 
element of 77 was removed. Thus, v = w. Hence, different sequences yj^x-i 
of consecutive pull downs are never coalescin^^ in that they always produce 
different heaps V\k from their residua H j no matter what the heaps H j ar^^ 
a routine induction argument yields the proof. As a result, different games 
are never coalescing as welj^ 

In particular, every heap has the unique sequence vi of consecutive 
pull downs which produce it (H;^, that is) from the heap Hi 

Thus there is a 1 — 1 correspondence between all heaps V\k and sequences 
of valid applications of Hfc.unRemoveMax(ifc). In particular, the number of 


54 


®°RemoveMax is not a 1 — 1 function since the range of its restriction to iV-element heaps 
has a lesser cardinality than its domain. 

®^So that the graph of the game of Pull Downs is a tree. 

^^As long as all moves in the said sequences are valid. 

^^Thus heaps and games form trees of sequences whose property of never coalescing 
implies the uniqueness of the path (from the root) to any given no de. 

^"‘The sequence \/i^k-i can be constructed via the equality (79), where V\i k-i is the 
reversed sequence of consecutive residua of the heap V\k- 
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different heaps of size K is equal to the number of different sequences 
of 77 — 1 patched 

The above observations prove the following Theorem that will allow me 
to reduce the worst-case analysis of RemoveAll to analysis of some winning 
strategies for the game of Pull Downs. 

The Mapping Theorem 6.1. For every heap H of K > 2 nodes and its 
residue H of J < K nodes, there is a unique sequence vjx-i of K-J 
consecutive pull downs that produce H from H; in particular, there is a unique 
sequence yi^K-i of K — 1 consecutive pull downs that produce H from the 1- 
element heap. 

Proof follows from the above discussion. □ 

I will call the unique sequence of pull downs mentioned in Mapping The- 
a creative sequence of H relative to H and denote it by 
In the case of H bring the 1-element heap Hi, will I call it simply a creative 
sequence and use notation W'{H) in lieu By Mapping Theorem 

for any H, W'p{H) is a 1 — 1 function of H and, therefore, has the inverse 
I will denote it by or, in the case of 77 = Hi, by fF. 

Thus is the sequence v of pull downs that result in heap 77 when 

applied consecutively to heap 77, and A^(v) is the heap 77 created by appli¬ 
cation of the sequence v of consecutive pull downs to the heap 77. 

It follows from the above definitions that 77 is a residue of 77 if, and only 
if, W'{H) is an initial proper subsequence of A^{H) 


6.1 


oreni 


6.1 


7. Credits for moves 


I define credit cr{i, k) for a move k applied to a heap H on i nodes to be 
equal to the number CRenioveMax()(77') of comparisons of keys that application 
of operation RemoveMax() to a heap 77' produced by 77.unRemoveMax(7) will 
perform. By the RemoveMax Cost Lemma |4.4| page it is given by the 


equality (58) page 28 It is a function of k and the number i of nodes of 77. 


^^Recall that not every node can serve as the fcth patch Vk since its index i = Hj, 
must satisfy an instance of inequality ( [45| for n = k. 
sspormally, if y{ff) C y{H) and y^{H) ^ y^{H) 
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The equation (58) page 28 can be rewritten to a human-readable form as 
follows, substituting i for N and k for i. 

If k is an index of a leaf with a sibling then 


*X = 0 , 


(82) 


and, by (32), 


cr(i,/c) = 2 [IgfcJ. (83) 

One can verify by means of direct inspection that in this case the right-hand 
side of the equation (83) also yields the credit for pulling down the parent of 

(84) 


the node at index k, that is. 


k 


cr{i, L 2 J) = cr{i,k) = 2[lgfcJ. 
If /c > 1 is an index of a leaf with no sibling then then 


and, by (32), 


#;ij -1, 


cr(q k) = 2[lg/cJ — 1. 


(85) 


pulling down the parent of the node at index k in this case, so 


k 

cr{i, L 2 J) = cr{i,k) = 2[lg/cJ - 1. 


As before, the right-hand side of the equation (|85|) also yields the credit for 

( 86 ) 

(87) 

( 88 ) 


Also, 


cr(l, 1) = 0. 

Since dehned by 

cr'^°‘^(i) = max{cr(i, k) \ k < i}, 


is, by virtue of the dehnition of cr{i, k) at the beginning of this Section, equal 
to the maximum number C^Re^oveMaxO comparisons of keys that the 
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RemoveMax() may perform on any heap on i + 1 node^^ application of the 
equality (60) page yields 


= [IgiJ + [lg(i - 1)J. 


(89) 


The loss of credit X{i, k) relative to the said maximum for a move 

fc in a heap on i nodes is dehned as: 

X{i, k) = — cr{i, k). (90) 

If X{i,k) =0 then the move k and the pull down H[k] that yielded the credit 
cr{i,k) are called lossless; otherwise, they are called lossy. 

The following Lemma characterizes lossless moves. 


Credit Loss Characterization Lemma 7.1. Let H he a heap of N nodes 
and let k he a valid move (that is, one that satisfies the inequality (fS) 
page 24). Move k is lossless if, and only if, one or more of the following 
conditions are true: 

(i) k = , or 

(ii) k = or 

(hi) k is a sibling nodC^ in the last of H, or 


(iv) k is the parent of a sibling nod^ of the last level 
Proof First, I will prove the if part of the Lemma. 


61 


ofH. 


Ill 


Since k belongs to the last level of H, < k. Thus, by equality (83) 

cr{N,k) > 2[lg2L^sAfJj = 2[lgiVj > 

> cr™“^(iV). 

cr{N,k) = cr^^^N). 


page 
[by equality (89)] 
Thus, 


particular, the heap produced by a pull down applied to a heap on i nodes. 
^SFormally, 2[|J < N. 

59Formally, 208^1 < k. 

®°Formally, 2k < N. 

®iFormally, 2 ^^sN] < 2k. 
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Hence, by the equality (90), 


A(iV, k) = 0. 


IV, 


has a similar proof except that it begins with the equality (84) page 38 


rather than with (83). 


Since k = k belongs to the last level of H, therefore is a leaf i 


in 


that level. If k has a sibling then the case (iii) applies, which completes the 
proof o 
that is 


proof of this case. If k has no sibling then its parent [|J has one child only. 


h o [ig -VJ 

iV = 2[|j = 2[ ^ J = 2[2L^sAfJ-ij ^ 2 X 2^*8^^-^ = 


j\j- _ 2Ug'^J_ 


Thus 

[lg(]V - 1)J = - 1)J = 

[by equality (§ page|g 

= [lg(2i‘'S''J)J - 1 = [[IgAfJJ - 1 = [IgAfJ - 1, 

that is, 

[lg(Af-l)J = [lgAfJ-l. 


(91) 


By the equality (85) page |38[ 

cr{i,k) = 2[lg2Lig^Jj - 1 = 2[[lgiVjJ - 1 = 2[lgiVj - 1 = 
[by equality d^l 

= Ligivj + Lig(iv-i)j = 


[by equality (89) page [39 
Thus, 




cr{N,k) = cr^^^{N). 


Hence, by the equality (90), 


A(iV, k) = 0. 
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IV, 


follows from (i) by virtue of equality (84), in the case k has no sibling, 
or equality (84), otherwise. This completes the proof of the if part of the 
Lemma. 

If k is neither a leaf nor the parent of a leaf, which means that Ak < N, 

#" > 0. (92) 


then, by the equality (33) page 21 


By virtue of equality 


page 


21 


a-(iV,J;)=2Llg*:J-2 + #['sj+#« 


and 


cr{N, 2k) = 2[\g2k\ - 2 + #f,j + = 


lN I uN 
- [fcj +1) 

= 2( [Ig k\ + 1) — 2 + = 

= 2LlgfcJ+#^ + #^,. 


Thus 


cr(7V,2fc) = 2LlgA;J+#f+ 


lN 


Subtracting (93) from (94), we get 


cr(N, 2k) - cr(N, k) = 2 + #g- #[1^ > #". 


Hence, by (92), 


and, therefore. 


cr{N, 2k) > cr{N, k) 
X{N, k) > 0, 


(93) 


(94) 


thus making k a lossy move. 

If A; is a leaf but not in the last level of H then [IgfcJ = [IgA^J — 1, so, by 


virtue of equality (32) page 21 


cr(7V, k) = 2( [Ig iVj - 1) - 2 + ^ = 2 [Ig iVj - 4 + ^ < 


< 2[lgiVj - 2 < [IgiVj + [lg(iV - 1)J = cr^^^{N). 
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Thus 
and, therefore, 
thus making k a lossy move. 


cr{N, 2k) > cr{N, k) 
A(iV, k) > 0, 


If A; is a parent thereof then, by virtue of equality (84), in the case k has no 


sibling, or equality (84), otherwise. 


cr{N, k) = cr{N, 2k) 

and the same conclusion as for the previous case follows. 

Since there are no other cases, this completes the prof of the only if part, 
which completes the proof of the Lemma. □ 

One can immediately conclude from the above Credit Loss Characteriza¬ 
tion Lemma 7.1| (i) that for every i > 2, there is a heap H of i nodes and 
an index k such that pull down H[k] yields the maximum credit cr{i,k) = 
cr™’“^(i)ru turning the inequality (88) into equality for such i, if, and k, and. 


therefore, making that inequality tight for every i >2. 

Function A will allow me for easy evaluation of payoffs for some strategies. 


Let’s consider for example a sequence of iV -|- 1 alternating pull downs 
(1, 2,1,2,...) applied consecutively to a complete heap on N nodes. If iV = 1 
then both moves are valid and lossless. For N > 2, since nodes 1 and 2 are 
the smallest nodes in any heap on N nodes, all these moves are valid as well. 


that is, the inequality (45) page 24 is satisfied for every -|- 1 < n < 2N -|- 1 
and i = ii“^[l], ii“^[2]. It turns out that in the latter case, all these moves 
are also lossless, except, perhaps, for the second move that may lose 1 credit 
to cr^°'^{n), as one can conclude from the following Lemma. 

The pq Lemma 7.2. Let H be a complete heap on N = — 1 nodes, 

let p,q, with p < q < be its leaves, and let s = {si \ 1 < i < M), 


®^For instance, pull down 1 in a heap of i nodes that satisfies the equality i7[24s*J] = 1 
yields the maximal credit cr"^°‘^{i). 
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where 2 < M < N + 1, he a sequence of M pull downs of p and q, applied 
consecutively to H, that is given by this regular expression 


pq(pq + qp)*+ pq(pq + qp)*(p + q). (95) 


(i) All moves ofs, except, perhaps, for move S 2 , are lossless. 

(ii) If = q then move S 2 is lossless. 

(iii) If H[^] > q then move S 2 loses 1 credit relative to the maximum 
2 [Ig for that move. 


Proof. Let H = 

s applied to H. 


^h{s) 


63 


be the sequence of heaps produced by pull downs 


Move Si (the first move in the described sequence) pulls down p, which 
resides in the last level [IgiVj of iL = Hi as a leaf with a sibling, down to 
index + 1 in H 2 . By the Credit Loss Characterization Lemma 7. mnini 
move Si is lossless, with 

X{N,Si) = 0. (96) 


Move S 2 pulls down q, which either resides in H 2 at index i = —^ as the 
parent of p (that has been pulled down to index A^ + 1 by move Si) or as a 
leaf with a sibling. In the latter case, it scores the same credit as move Si 
did, that is, 

cr{N + 1,S2) = 2[lg A^J = 

[since N = — 1 and, therefore, [lg77j = [lg(A^ + 1)J — 1] 


Lig(iv + i)j + [igivj -1. 


Thus, 


cr(AT+ 1,82) = Llg(^ + l)J + [Ig^J - 1 - 


(97) 


This, by (89) and by inequality (88) on page 38[ yields one less than the 


maximum [lg(A^ + 1)J + [Ig A^J of credit cr{N +1, k) for any k < N + 1, thus 
making move S 2 lossy, with 


A(A^ + 1, S 2 ) — 1. 


This completes the proof of case 


111 


(98) 


®^Function was defined by (781 page 


35 
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at index i = ^ as the 


In the former case {q resides in H 2 
parent of its only child p), move S 2 pulls down q. By the Credit Loss Char¬ 
acterization Lemma 7.1 (ii), move $2 is lossless, with 


A(iV,si) = 0. 


This completes the proof of case (ii 


Moves S 2 j-i and S 2 j, where 2 < j < pull down p and g, which reside 
in level [lg(iV -|- 1)J of heaps H 2 j_i and H 2 j as leafs with siblings. By the 
Credit Loss Characterization Lemma 7.1 [(hi) each of those moves is lossless, 
with 

A(A^, S2j-i) = A(iV, S2j) = 0. 


This, together with (96), completes the proof of case[^ □ 

I will use special cases of sequences pq(pq + qp)*, namely, p = l,q = 2, 
in establishing some important constrains on optimal strategies, and later in 


Section 10, p = 1, g = 4, in design of an optimal strategy for the game of pull 
downs. 


8. Strategies and their payoffs 

A strategy s is an inhnite sequence (s„+i | n G tn) of valid moves con¬ 
secutively applied to the 1-element heap Hi. Given a strategy s, I will call 
the sum of the the credits, dehned in Section page 37, for all moves of s 
between n-th and m-th mov^^ inclusively, where 2 < n < m, the the payojf 
Ps{n,m). It is given by this formula: 


m 


Ps{n, m) = cr{i, Sj 


(99) 


The following lemma shows that the payoff for the game, dehned on 
page that the Player plays with a strategy s is equal to the payoff 
Ps(2, iV — 1) for the strategy s. 


®^Racall that ith move of any strategy produces an i-|-1-element heap out of an i-element 
heap 













M. A. Suchenek: A Complete Worst-Case Analysis of Heapsort (MS) 45 


The RemoveAll Cost Lemma 8.1. Let H be a heap of N > 2 nodes and s 
be its creative sequence. 


N-l 


c 


RemoveAll() 


(H) = X] 


( 100 ) 


i=2 


Proof by induction on N. For N = 2, both sides of (100) are 0, which 
observation completes the basis step. 


Let (100) be true for some N > 2, H he a heap of iV + 1 nodes, s be its 
creative sequence, H' be the heap produced by execution of if.RemoveAll(), 
and s' be the creative sequence for H'. By the above dehnition of if'. 




RemoveAll 


()(if) = G 


RemoveAll 


0 ) “1“ C^RemoveMax() (-^) * 


( 101 ) 


By the dehnition of creative sequence at the end of Section on page ^ 
Si,v = s'. Since if' has N nodes, by the inductive hypothesis we get 


N-l 


a 


RemoveAll() 


(if') = ^cr(i,sO. 


( 102 ) 


i=2 


Substituting (102) to (102), we obtain 

N-l 


GRemoveAllO ('^) E Crii.^ Sj) -|- GR0nioYeMax() (hi) 
i=2 

[by the dehnition of credit at the beginning of Section page 37 

N-l 


i=2 


N 

i=2 


which yields (100), thus completing the inductive step. 


□ 


If m = — 2 and n = — 1 then all moves between n-th and 

m-th move, inclusively, construct the level [IgmJ of the output heap. In such 
a case, both the argument of the n-th move and the result of the m-th move 
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of s are consecutive®^ complete heaps; I will call the payoff Ps{n,m), given 
by the equality (99), a level payoff and denote it by Pf''" ([Ig m \). Formally, 


2^+i-2 

P.'“(/<')= 5^ cr(i,s.). 


(103) 


i=2K-l 


Informally, it is the sum of credits of moves S 2 k_i 2 K+ 1-2 of s that added the 
next level K to the complete heap - 2 ) on 2^ — 1 nodes created 

by the hrst 2^ — 2 moves 2^-2 of s consecutively applied to the 1-element 
heap Hi. 


Similarly, I will call the sum of the the losses for all moves of s between 
n-th and m-th move, inclusively, where 2 < n < m, the the accumulated loss 
As{n,m). It is given by this formula: 


m 


As(n,m) = ^As(i), 

i=n 

(104) 

where 


As(0 = A(qSi), 

(105) 

with abbreviated notation 



As(m) = As(2, m) 


(106) 


that yields 

As(n, m) = As(m) — As(n — 1). (107) 

If As(n, m) = 0 then I call the sequences {ki \ n < i < m) of moves and 
(Hi[fcj] I n < i < m) of pull downs of s lossless; otherwise, I call those 
sequences lossy. 

The level loss at level 77 > 1 is given by this formula: 


A^e^(X) = A,(2^ - 1, 2^+^ - 2). 


( 108 ) 


If = 0 then I call the level K lossless; otherwise, I call it lossy. 


®^The difference between their depths is equal to 1. 
®®Function ^ has been defined on page 
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It follows directly from the definition (108) that for any natural number 


D: 


D 


= As(2, 2^+^ - 2) = As(2^+^ - 2). 


(109) 


K=1 


If M is the size of the largest complete heap of no more than m nodes 
and D is the depth of that heap then 

As(2, m — 1) = As(2, M — 1) + As(M, m — 1) = 


[by (109), taking into account that M = — 1] 


D 


^ A'"’'(i7) +As(M,m- 1) 


K=1 


[by virtue of equalities (24) and (25) page 16 


Llg(m+l)J-l 

= Y + As(2Li§(”^+^)J - 1, m - 1). 

K=l 

Thus, for any m > 3 

Llg(m+l)J-l 

A,(2,m-1)= Y + As(2L'g(”*+^)J - l,m-1). (110) 

K=1 

If, moreover, m = — 1 then [lg(m + l)J — 1 = [IgmJ and 2l-'s(”^+i)J —1 

= m, so that As(2l-^®(™+^)-l — l,m — 1) =0, and (110) reduces to 


[Ig m\ 

A.(2,m-1)= Ai”(A'). 

K=1 


( 111 ) 


If 77 is a residue of heap H then the payoff earned by the creative sequence 
^s(/f)0is given by: 


m—1 


^HH ~ "Yj 


( 112 ) 


®^Function ^ defined on page 37 
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where n and m are the sizes of heaps H and H respectively. If H is the 1- 
elenient heap then I will nse Ph as an abbreviation of Ppjj- By the eqnality 
(100) in the RemoveAll Cost Lemma 8.1|page 45, 


Phh = C. 


RemoveAll0 


(H) - Q 


RemoveAll() 




(113) 


where CRemoveAii()(-^) is fhe nnmber of comparisons of keys performed by the 
execntion of F.RemoveAll() on heap F. Thus, 


Phh = Ps{n,m-1), (114) 

where s is any strategy whose n — 1st move produces heap H and m — 1-st 
move produces heap H. 

Under the same assumptions as above, the accumulated loss (of credits) 
for the creative sequence is given by: 

m—1 

^HH ~ ’^H{H)i)- (i-i-5) 


If H is the 1-element heap then I will use Ah as an abbreviation of Ap^^. on 
heap F. Thus, by (104), it satisfies this equality: 


^HH = As(n,m - 1), 


(116) 


where s is any strategy whose n — 1st move produces heap H and m — 1-st 
move produces heap H. 


I will call the payoff P^^{n, m) yielded by the sum of credits that match 
the maximum cr'^°'^{i) given by (89) page 39 for all moves of between n and 
m, inclusively, where 2 < n < m, the upper-hound payoff. It is given by this 
formula: 


i=n 



([ig?J + [ig(i - i)J). 


69 


(117) 


39 


®®By vitrue of (891 page 
®®It is known that for n = 2, the right-hand side of (119) reduces to a closed-form 
formula (2m — 1) [lg(m — 1)J — 20s(™“^)J+2 _|_ instance, proof of Theorem 8.1 in [10] 
provides a derivation of it. Thus for any n > 2, the right-hand side of (117) reduces to a 
closed-form formula (2m -h UUgmJ - 2 ^^sm ]+2 _ _ l)[lg(„ - 1)J + 
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For any heap H on m nodes and its residue H on n < m nodes we have: 

(n,m — 1) — = 


[by (117) and (115)] 


m—1 




[by (90)] 




m—1 


m—1 


= > cr™“^('A 


Y^{cr^-^{^)-cr{^,yp{H),)) = cr(*, 


[by ([^] 
Thus 


— P- 


PHH = P''''{n,m-l)-A 


HH- 


( 118 ) 


As a special case of (117), we have: 


m—1 


m—1 


P^^(2,m-l) = ^(Llg*J + Llg(^-l)J) = 2 5^Llg*J - Llg(”^-l)Ji (119) 


i=2 


i=2 


which, by virtue of equality (20) in [10], yields: 


Thus P^^(2,m — 1) is and upper bound for C'^emoveAii()(^)- 


( 120 ) 


The dehnitions (90), (99), (104) and (117), yield for every 2 < n < m: 
Ps{n, m) + As(n, m) = P^^{n, m). 


( 121 ) 


In particular, by virtue of (89) and the inequality (88) page 38, we have 
for every 2 < n < m: 


Ps(n,m) < P^^{n,m). 


(122) 
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Equation (121) will allow me for easy derivation of the value of payoff 
Ps{n,m) from its upper bound given by a known closed-form 

formula, once I have computed the loss of credit As (n,m). 

Thus the Player draws at random an integer N > 2, choses any strategy 
w, and plays the first — 1 moves of it. By the definition of Player’s payoff 
his total payoff is equal to the number of comparisons that 


34 


on page 

HAr.RemoveAll() will perform while run on the heap Hat = ^(wi_Ar_i) 
produced by the N — 1st move of the strategy w. This, by virtue of the 


definition (99) page 44 of Pw{N — 1) and the RemoveAll Cost Lemma 8.1 
page 1^ is eqnal to 


Af-l 

P^{N - 1) = Pw(2, - 1) = ^ cr{i, Wi). (123) 

i=2 

His total loss of credit, defined as 

7V-1 

Aw(iV-l) = A,„(2,iV-l) = (124) 

i=2 


determines whether the strategy w he choose is optimal, according to the 
following Lemma. 


Lemma 8.2. A strategy s is optimal for N > 2 if, and only if, for every 
strategy u, 

N-l N-l 

5) K{i) < Y. A„(i). (125) 

i=‘2 i=2 


Proof. s is optimal for A^ > 2 if, and only if, for every strategy u, 

N-l N-l 

^ cr{i,Si) > ^ cr{i,Ui), 

i=2 1=2 


or 


N-l 


N-l 


^(cr™“"’(i) - cr{i,Si)) > ^(cr”"“"’(z) - cr{i,u.f)), 


i=2 


i=2 


or, by (pb|) on page 39, (125) holds. 


□ 


70 


The function k/' has been defined on page 


37 














M. A. Suchenek: A Complete Worst-Case Analysis of Heapsort (MS) 51 



Figure 10: A draft of the definition of optimal strategy from an earlier manuscript. 

The Player wins if the payoff Pw{N — 1) for the strategy w he choose is 
optimal, that is, cannot be beaten by any strategy for the same N. 


Theorem 8.3. A strategy w is optimal for N > 2 if, and only if, its first 
N — 1 moves produce a heap Hjv = l^(si^jv-i) ^ that forces HAr.RemoveAll() 
to perform the worst-case number of comparisons of keys. 


Proof. Suppose that w is an optimal strategy for N but the heap Hjv is 
not a worst-case heap for RemoveAll. In such a case. Player B can pick 
up an Af-element worst-case heap Gn for RemoveAll, run GAr.RemoveAll() 
on it while recording a sequence \ 1 < i < N) of patches that were 
used to £11 the vacancies left by the removed maximal elements, and then 
play any strategy v whose first N moves are given by the reversed sequence 
{kn-i I 1 < f < N). Since the payoff for the Player A, by the definition 
(32) of cr{i, k) page 32, is equal to the number of comparisons performed by 
HAr.RemoveAll(), and the payoff for the Player B is, equal to the number of 
comparisons performed by GAr.RemoveAll(), the payoff for the Player B is 
larger than the payoff of the Player A, and so the Player B wins, contrary 
to the assumption that the Player A w had an optimal strategy for N. □ 


The player could try to apply a brute force and devise a greedy strategy 
that maximizes credits for its all moves. This could be done relatively easily 
with a program that attempts to generate a greedy substrategy for any given 
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size of a worst-case heap. Unfortunately, as I illustrate in Appendix B 
page |112[ any such greedy substrategy must fail to produce any worst-case 
heap of more than 22 nodes. 

In the next Section, I will use a bit subtler approach to generation of 
worst cases for RemoveAll. 


9. Strategies par and win, and the lower bounds they establish 



Figure 11: A worst-case heap Hy of 7 nodes for RemoveAll. 


Given an integer Af > 2, I will construct an optimal strategy win(Af) for 
that N. For N < 7, one can take any worst-case heap H of 7 node^^ (for 
example, the heap [7, 6, 3,4, 5, 2, 1] visualized on Figure [IT| and extract its 
creative sequence {H) of the hrst 6 pull downs (for example, the creative se¬ 
quence (1,1,1,1, 2,1) for the heap [7,6, 3,4, 5,2,1], given by the hrst 6 moves 


(1, 2, 3,4,4, 5) of the sequence shown in Appendix A.l page 105) of a strat¬ 
egy from it; we will see in a moment that (e.g., (1,1,1,1, 2, l)'^s), 

where s is an inhnit^^ sequence of pull downs, for instance, (1,1,1,...), is 
an optimal strategy for every 2 < N < 7. Some straightforward experi¬ 
mentations with different sequences of pull downs allow one to hnd optimal 


^^Beginning with a worst-case heap on 8 nodes and using heaps of N = 20®^! nodes as 
benchmarks, instead, would be, perhaps, more rational but it would also make the their 
illustration on Figures 


72 


11 


and 12 less neat. 


Just to make the entire sequence infinite. 
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strategies for < 12 (for example, ( 1 , 1 , 1 , 1 , 2 , 1 , 1 ,4, 1 ,4, l)'^s is such an 
optimal strategy the hrst 11 moves of which produce a 12 -node worst-case 
heap visualized on Figures and pages W and ^ see Appendix A.l 
page 105 for details of construction of such a heap). 


For A^ > 12, such a construction of is easy if the sought-after worst- 
case heap is complete, that is, if A^ = 2 ^^^^^^ — 1 ; the mentioned above 
sequence A^{H) followed by any inhnite sequence of pull downs that makes 
their concatenation s’s level loss = 1 for every K > 3 (for instance, 

( 1 , 1 , 1 , 1 , 2 , 1 ) followed by the inhnite sequence ( 1 , 2 , 1 , 2 ,..., 1 , 2 ,...) of al¬ 
ternating pull downs of 1 and 2) will do. However, showing that such a 
construction results in an optimal strategy for any N = 2^^^— 1 is nothing 
but a routine exercise due to the number of cases to consider in order to 



One can easily verify (a program or an argument will do) that application 
of operation H 7 .RemoveAll() to H 7 performs 14 comparisons of keys, which 
number happens to be equal to the knownj^ upper bound 


7-1 




RemoveAllO 


(7)=2^Llg"J-llg(7-l)J 


i=2 


^^See |S] for an example of such a proof. 

this a combination of the strategies (1,2,1,2,..., 1,2)'^s and 
(l,l,l,l,2,l,l,4,l,4,l)-s 
^^At least the one I know. 

^®See, for instance, m for a proof of the formula for any heap on A^ > 2 (not just on 
N = 7) nodes. 
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Figure 12: The invariant fragment of the heap on m = 24s4+i _ i nodes, where i> 7, 
for the strategy par, with locations of nodes 1, 2, 3, and 4 shown. 


on the number of comparisons of keys that the RemoveAll performs on any 
heap of 7 nodes. Thus 14 is the worst-case number of comparisons of keys 
that the RemoveAll performs on any heap of 7 nodes, and, therefore, H 7 is a 
worst-case heap for RemoveAll. 

The heaps Hg through Hi of H are defined as the residua of successive 
applications of RemoveMax() to Hy. The creative sequences of those residua 
are the beginning subsequences of the creative sequence for Hy. Because 

7 

'^.“ov..ii()(7) = (126) 

i=3 

where G'n^oveMaxO (*) maximum number of comparisons of keys that 

the RemoveMax performs on any heap of i nodes, those creative sequences are 
lossless. As a result all the residua of Hy, that is, the heaps Hg through Hi, are 
automatically worst-case heaps for RemoveMax and, therefore, for RemoveAll. 

After the 7th move, the strategy par is defined inductively level-by-level. 
For any I with. 


7 < / = - 1 = 2 ^®^ - 1, (127) 

it takes the heap Hy on I nodes, whose scheme is visualized on Figure 12 


(for instance, heap Hy of Figure 11 falls under that scheme), that it has 


77 


Since I > 1. 
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650 

651 S 

652 

653 

654 

655 

656 

657 

658 

659 

660 
661 
662 

663 

664 

665 

666 

667 

668 

669 

670 

671 

672 

673 

674 L 


// 


// 

// 


public static int[] Par (int L) 

{ 

int N = MyMath.twoTod)-1; // the number of nodes 
First, level 0, 1, 2 

int[] s7 = {0, 1, 2, 3, 4, 4, 5}; // a winning strategy for N == 7 
int[] strategy = new int[N]; 
for (int i = 1; i < Math.min(N, 7); i++) 
strategy[i] = s7[i]; 
if (N <= 7) return strategy; 

More than 3 levels 

Constructs strategy Par level-by-level from level 3 to L-1 
for (int lev = 3; lev < L; lev-^■^) 

{ 

int I = MyMath.twoTo(lev)-1; // first move in the current level 
strategy[I] =1; // 1 pulled down 
strategy[I-^l] = I-l; '! 2 pulled down 

strategy[I-H2] = I■^l; // 1 pulled down - 4 is at the begining of thi 

strategy[I-i-3] = I■^2; // 2 pulled down 

strategy[I-^4] = 1+4; 11 2 pulled down 

strategy[I-H5] = I-^3; // 1 pulled down 

for (int i = I■^6; i <= 2*1; i-n-) 
strategy[i] = i-l; 

} 

return strategy; 


Figure 13: A Java code that implements strategy par. Method Par(L) returns the first L 
levels of moves of par. The static method twoTo(n) in class MyMath computes 2”. The 
sequence of moves (1, 2,3,4; 4j 5) in line 454 produces the heap Hy of Figure 11 created 


by the sequence (1, !> !> !> 2,1) of PuU downs; it comes from the equality (811 page 36 


Instruction in line 663 is a bit wasteful and could be replaced with I = lev; followed by 
I = 2 * I + 1; before the line 672, but it comes handy as is while evaluating the outcomes 
of method Par(L). 


constructed so far, and adds to it the next level lev by performing on H/ 
a sequence par^ 2 / of -f + 1 consecutive moves such that the resulting heap 
H 2/+1 on 2/ +1 nodes falls under the scheme of Figure [T^ and all these moves 
are lossless except for one move pary_,_^ (the second move in the described 
sequence) that scores a credit that is one less than the maximal score. 


More specihcally, move par^ (the hrst move in the described sequence), 
implemented by the statement at line 664 in the method Par(L) shown on 
Figure pulls down 1 that resides at index J of heap H/. That move, 
among other effects, brings node 3 to the last index / in the level [Ig/J of 
H /+1 and makes node 1 the child of node 4 at index Nodes 2 and 4 
maintain their indicies J — 1 and respectively, in heap H/+i. 


Move pary^_]^, implemented by the statement at line 665 in the method 
Par(L) shown on Figure 13, pulls down 2 that resides at index / — 1 of heap 
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H 7 + 1 . Nodes 3 and 4 are not affected becanse they reside at the same level 
[Ig/J as 2 did in Hj+i. 

Move parj_,_ 2 , implemented by the statement at line 666 in the method 
Par(L) shown on Fignre 13, pnlls down 1 that resides at index / + 1 of heap 
H/+ 2 - That move, among other effects, brings its parent 4 to the first index 
/ + 1 of the last level [Ig/J + 1 of of H/_|_ 3 . Nodes 2 and 3 are not affected. 

After that, the remainder moves (par^ | 1 + 3 < n < 21), implemented by 
the statements at lines 667 throngh 672 in the method Par(L) shown on Fig¬ 
nre keep pnlling down 1 and 2 that reside as leaves in heaps H/+ 3 ,..., H 27 , 
making snre that they always pnll down a node with a sibling, and that the 
two last moves are pnlling down 2 and then 1. Nodes 3 and 4 are not affected 
by any of those moves, except that node 3 becomes the parent of node 2 in 
H 2 / and the parent of nodes 1 and 2 in H 2 /+ 1 . 

Obvionsly, all the moves are valid becanse no other node of any heap is 
smaller than 2 and 1 . 


The resulting heap H 2/+1 falls under the scheme of Figure 12; the first 


node H 27 +i[/ -P 1] in the last level of H 2/+1 is 4, the last two nodes H 2 /+i[ 2 /] 
and H 2 /+i[ 2 / -P 1 ] in the last level of H 27-+1 are 2 and 1 in that order, and the 
parent H 2 /+i[/] of 1 and 2 in H 2/+1 is 3. Moreover, by the pq Lemma 7.2 
and 


111 


_ page 42 substituting 1 for p and 2 for q, taking into account that 

H/[^] = 4 > 2, all the moves were lossless, except for the I + l = 2 l-^®T+i)Jth 


move parj_,_]^ (the second move S 2 in the sequence of the pq Lemma 7.2) that 


scored a credit one less than the maximal credit for the / -P 1st move. 
This way we proved the following two theorems. 


Theorem 9.1. Every heap H on N = — 1 nodes, where N > 7, 


produced by strategy par satisfies the invariant visualized on Figure 12 


Proof follows from the above discussion. 


□ 


Theorem 9.2. For every i>l, 


Apar(0 = 


hf S<i = 2Ligd 
0 otherwise. 


(128) 
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Proof. If i < 7 then Apart* 


= 0 because the first 6 moves of strategy par are 
lossless. If * > 7 then, by ( |l26 ), Apar(*) = 0 for all i except / + 1 = 
in the above discussion, that is, i = 2 l-*§b^ for which Apar(*) = 1- □ 


Figure 14 shows a graph of Apar(*) 



Figure 14: A discrete graph of function Apar(z); the characteristic function of the set of 
powers of 2 that are greater or equal 8. 


A Java code that implements strategy par is shown on Figure and 

an example of a heap produced by strategy par is shown on Figure The 

subsequence par^ ^4 of the first 14 moves of par that produced it is: 

(1,2,3,4,4,5,7,4,8,9,10,12,12,13), 

and the sequence (H„[par^] | 1 < n < 14) of corresponding pull downs is: 

(1,1,1,1,2,1,1,2,1,2,2,1,2,1). 


The sequence Hpu = (^Hi(pa*’i,i 4 ) of 14 heaps created by these moves is 
shown in the [Appendix A~ page 105 


Let for every * > 1, the delayed loss function A* be defined by: 

1 if i = 2 rigd _4 


AT*) = 


(129) 


0 otherwise. 


I will show in Section 10 that function establishes a lower 

bound on accumulated loss of credit for moves 1 through n for any strategy. 
The following Lemma provides a closed-form formula for that lower bound. 


Lemma 9.3. For every n>l, 

n 

^ A* (*) = max{ [lg(n + 4)J, 3} - 3. 

i=2 


( 130 ) 
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RemoveAllRec: 15-element heap deconstructed in 58 comps. 
RemoveAllRec: Theoretic worst-case number of comps = 58 
RemoveAllRec: Number of comps for heap built with Par = 58 
»> Worst-case: 

15-element heap level-by-level: 

Level 0: 15 

Level 1: 

Level 2: i 

Level 3: 4^^''6 10^^''5 7''^^8 2^^^1 

Heapsort: array sorted in 80 comps 

Heapsort: Theoretic worst-case number of comps = 80 

The strategy was the first 14 pull downs of par 


<1, 1, 1, 

1, 

2, 

1, 

1, 

2, 1, 

2, 

2, 

1, 2, 1> 

or moves: 
<1, 2, 3, 

4, 

4, 

5, 

7, 

6, 8, 

9, 

11 

10, 12, 13> 


BUILD SUCCESSFUL (total time: 0 seconds) 


Figure 15: A complete heap H 15 produced by the first 14 pull downs of the strategy par 
(an output of my Java program). The pull downs and the moves are shown at the bottom 
of the output. 


Proof. By virtue of definition (129), -^*(0 equal to the number of t’s 

that are less than or equal n, with 12 < i — 4, or 16 < z + 4 = . 

Since 2"^ is a 1 — 1 function, that number is the same as the number of different 


values of [Igz] between 4 and lg(u + 4), which is equal to [lg(n + 4)J — 3 for 
[lg(n + 4)J > 3 (same as n > 4) or 0 otherwise. 


Hence, 


A*(z) = max{ [lg(n + 4)J — 3, 0} = max{ [lg(n + 4)J, 3} — 3, 

i=2 


which yields (130). 


□ 


Graphs of functions X*{N) and visualized on Figure 


16 


Lemma 9.4. For every K > 1, the following equality holds: 

2K+i_2 

A‘p'”‘w= 

i=2^-l 


(131) 
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Figure 16: Discrete graphs of the delayed loss function X*{N) (upper graph) and its sum 


A*(i) (lower graph), for 2 < iV < 70. 


Proof. The least solution of the equation 

i = 2rig*i _ 4 


(132) 


that occurs in (129) isi = 12, so X*{i) = 0 for i < 7. Thus ~ ^ 

for K = 1,2. Since, by (128), Apar(«) = 0 for i < 7, also Ap™®^(77) = by the 


equation (108) page 46 •^pa''(*) = 0 for 77 = 1,2. Hence (131) holds 

for K = l^T 


For each K > 3, the equation (132) has exactly one solution that satishes 


2^ — 1 < z < — 2, namely, i = 2^^^ — 4, and so does the equation 

i = 2l-'sd that occurs in (128), namely, i = 2^. Thus both Apar(*), 


that by (108) is equal to and equal 1. Hence, 

by (108),' the' equation (131) holds for each K > 3. □ 


Lemma 9.5. For every D > 1, the following equality holds: 

2D+i_2 

Ap„(2'’+‘- 2) = A*(*). 

i=2 


Proof. By the equality (109) page 47 


D 


Ap.,(2'’+‘ - 2) = ^ A'“"(A') 


K=1 


[by the equation (131) in Lemma 9.4 


D 2 ^+ 1-2 2 °+ 1-2 

E E E 

K=li=2K-l i=l 


(133) 
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[since A*(l) = 0] 


Hence, (133) holds. 


4 

3 

2 

1 


10 


20 


2^+i_2 


i=2 


30 40 50 


60 70 


□ 


Figure 17: A discrete graph of function Apar(fV — 1 ) = Apar(*), where Apar( 7 ) is the 

function visualized on Figure 14 page 57 Clearly, for N > 5, Apar(A^ — 1) = [Ig A7] — 3 = 

Llg(iv - 1)J - 2. 


Figure 17 visualizes graph of function Apar(iV — 1). 

Not surprisingly, it coincides with function A*^^{N — 1) for some N, as 
the following Lemma states. 

Lemma 9.6. For every N > — 4, the following equality holds: 

N-l 

Ap.,(iV-l) = EvW 


i=2 


78 


(134) 


Proof. If Af = then the equation (134) follows from (134) 


since 


Apar(2L^^^J - 1) = 0 = X*{2\-^SN\ _ 
If < N and - 4 < iV then 

Apa.(2Lig^J,iV-l) = l, 

and 

Af-l 

E v(i) = i. 


(135) 

(136) 


j=2Lif5JVJ 


^®One can also show, using the fa ct th at par loses to another another strategy for every 
N < — 4, that the equation (134) holds only if A^ > 2 r^sWl _ 4 ^ j do not need 

that fact in the proofs presented in this paper. 
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Thus 


[by (135)] 


Ap3.(iV - 1) = - 1) + Ap3.(2Lis'^J, iV - 1) = 

= Ap3,(2L'g^J - 1) + 1 = 


[by (136) and the already proven case of (134) for N = 

2LigJVJ-i jY-1 N-l 

= E E a*(*) = 5^a*(z). 

i=2 i=2LigJVJ i=2 


Hence, (134) holds. 


□ 


The about facts about strategy par is all I need to derive and prove the 
main results of this paper. Below is its more definitive characteristics that I 
quote here for an illustration. 


The payoff Ppar{N — 1) after A^ — 1 moves of the strategy par for N >2 
is given by this formula: 


2{N - 1) [lg{N - 1)J - + min( [lg{N - 1)J, 2) + 4. (137) 



Figure 18: A graph of function Ppar(tV — 1) of payoff for strategy par. Also, a lower bound 
on the number of comparisons of keys performed in the worst-case by RemoveAll on an 
iV-element heap. 
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Figure 18 shows a graph of the total payoff function Ppar for par. Of 


course, Ppar, 


given by (137), is a lower bound for the number of comparisons 


of keys that RemoveAll performs in the worst case on an A^-element heap. 


586 

587 B 

588 

589 

590 

591 

592 

593 

594 

595 

596 

597 

598 

599 

600 
601 
602 

603 

604 

605 

606 
607 L 


// 

// 

// 


public static int[] Win (int N) 

{ 

int[] strategy = new int[N]; 

int[] s = ParCMyMath.floorLg(N)+l)i // arg. is the number of levels 
for (int i = 1; i <= N-1; i++) 
strategy[i] = s[i]; 

int I = MyMath.tivoToFZoor/.g(N+l)-1; // last index in the last full level 
if ((N == I)||(N<=7)) return strategy; 

Incomplete heap of size > 7 

Leave moves of Par for all full levels 

Use greedy for the last (incomplete) level begining with I 
strategy[I] = I; // 1 pulled down (same as in Par) 
if (N == I+l) return strategy; 
strategy[I+l] = (I+l)/2; 4 pulled down 

if (N == 1+2) return strategy; 
for (int i = 1+2; i <= N-1; i++) 

if (i == 2*1-2) II can't pull 4 down 
strategy[i] = i; // 1 pulled down 

else 

strategy[i] = i - 1; // alternating pull downs 
return strategy; 


Figure 19: A Java code that implements strategy win(iV). Method Win(N) returns the 
first iV — 1 moves of win(P). The static method floorLg(N) in class MyMath computes 
[IgPJ. The static method twoToFloorLg(N + 1) in class MyMath computes 2L's(-^+i)J. 
Computation of the moves in the last level (lines 597 through 605) overrides the respective 
moves of the strategy par (returned by the call to method Par in line 589 and copied to 
array strategy in for-loop in lines 590 and 591) in order to simplify handling of the 
special case of the method Win(N) for N < 7. 


The strategy par loses 1 credit per level, beginning with level 3, and in 
the very second move of each such level. Thus switching to a greedy strategy 
in the last level of the produced heap (moves through N — 1, provided 

that N > 21-^^'^J) may help one to avoid the loss of credit in the last level and, 
therefore, result in a set of strategies that are better than par for inhnitely 
many N. 

It turns out that, given iV > 8, a strategy win(Af) that is a combination 
par + gre of the strategy par, played for all levels from 0 to [lg{N + 1)J — 1, 
and a greedjj^ strategy gre for the level [lg{N + 1)J if N > — 1, 


^®One that does not admit any delayed gratification, that is, takes a loss only if no 
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is an optimal strategy for N, that is, it produces a worst-case heap in its 
first N — 1 moves. I am going to show that for every N > 9, the above 
arrangement allows win(A^) to postpone the loss at the move that the 

strategy par would have to incur, up until the move — 4 of win(A^) if 

2 rig''vi _ 4 ^ Qj, avoiding that loss altogether, otherwise. In the latter 
case, for all N < 21^^®^^ — 4, strategy win(iV) will beat par by 1 credit after 
completion of their first — 1 moves. This fact and the formula (137) for 


Ppar{N—l) will yield the formula for payoff Pwin(Af)(-^—1) for strategy win(iV). 


I define the greedy strategy gre that in the case of 

N > 2L'g(^+i)J - 1 > 7 


(138) 


is played in the last level [IgiVj = [by ( 138| )] [lg(iV -|- 1)J of the constructed 
heap as follows: 

(i) It pulls down 1 (the last element of the heap of 2l-*s(^+^h — 1 nodes that 
is shown on Figure [I^. 

(ii) It pulls down 4 (the new parent of 1). 

(hi) It repeats steps and (ii) (in that order) until (ii) is invalid. 

(iv) It pulls down 1. 

(v) It pulls down 1 and 2. 

(vi) After that, it keeps executing pull downs of 1 indefinitely (just to make 
win(A^) an infinite sequence) 

More specifically, gre is defined inductively level-by-level, from level 3 up. 


For any N that satisfies inequality (138), it takes the largest complete heap 
H of no more than N nodes, which by (|24[), happens to have 


j ^ - 1 


(139) 


nodes, that was created by the first / — 1 moves of par and, therefore, by 
virtue of Theorem 9d_, falls under the scheme visualized on Figure \C2\ and 
performs on it a sequence win(A^)/ jv-i of A^ —/ consecutive moves as follows. 

The move win(A^)/ (the first move gre^ in the described sequence), im¬ 
plemented by the statement at line 597, with I computed by the statement 


at line 592, in the method Win(N) shown on Figure 19, pulls down 1. That 


lossless move is valid at the moment. 
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move, among other effects, brings node 3 to the last position i in the level 
[IgzJ of Hj+i and makes 1 the only child of 4. 

The move win(iV) 7 _|_i (or gre 2 ), implemented by the statement at line 599 


in the method Win(N) shown on Figure 19, pulls down 4. 

After that, the moves (win(iV)„ \ I + 2 < n < min{2J — 3,N — 1}), 
implemented by the first part of the f or-loop where i < 21—2 (the statements 
at lines 601 through 606) in the method Win(N) shown on Figure 19, keep 


pulling down 1 and 4 that reside as leaves in heaps H/+ 2 ,..., Hinin{ 27 _ 3 ,Af-i}, 
making sure that they always pull down a node with a sibling. All those are 
valid moves since 1 is the least node in any heap and the only two nodes 2 
and 3 that are less than 4 reside at the last two indicies / — 1 and I of level 
[Ig/J in heaps H7+2, Hmin{ 27 - 3 , 7 V- 1 }- 

An example of situation after move win(A^) 2/-3 for N = 28 and / = 15 is 
shown on Figure page 67 

If Win(N) is not done at this point, that is, if 2/ — 2 < A^, then the move 
win(77)27_2, implemented by the statement at line 603 in the method Win(N) 
shown on Figure [T^ pulls down 1 , which resides in H 27-1 as the parent of 1 
that is a leaf at index 21 — 2 and has no sibling. So, it scores the same credit 
as move win(A^)/+i that pulled down 4 for the first time did, that is, 

cr{21 - 2 , win(iV) 27 _ 2 ) = 2 [lg(/ + 1 )J - 1 = 


[by virtue of (127)] 


Llg(2/-2)J + Llg(2J-3)J-l. 


This, by (89) and the inequality (88) on page |38| yields one less than the 
maximum of credit cr{21 — 2,k) for any k < 21 — 2, thus making move 
win( 77 ) 27-2 lossy, with 

Awin(7V)(2/-2) = 1. (140) 

After that, if Win(N) is still not done, that is, if 2/ — 1 < A^, then the 
reminder move win(A^) 27 -i, implemented by the statement at line 605 in the 


method Win(N) shown on Figure 19, pulls down 2 that resides as a leaf with 
siblings in heap H 27 _i. By the Credit Loss Characterization Lemma 7.1 


111 


page 39, move win(A^) 27 _i is lossless, with 


A(iV,si) = 0. 
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At this point, Win(N), that computes only the hrst N — 1 moves of win(iV), 
terminates because 21 — 2 > N, so the next move after the move 2/ —3 would 
fall beyond that range. 

Obviously, the last two moves are valid because no other node of any heap 
is smaller than 1 and 2. Moreover, by The pq Lemma 7.2 (ii)|and (hi) page 


42 


7.2 (ii)|and 

(hi) 

that 

-1 = 


the moves up to and including the move win(iV)min{ 2 r- 3 ,Af-i} were lossless. 
Thus, since 2/ — 3 > — 1, all the moves between win(A^)/ and win(iV) 7 v_i 

were lossless, except for move win(A^) 2 r-i that scored a credit one less than 
the maximal credit for 21 — 1st move. 

This way I proved the following theorem. 

Theorem 9.7. For every i < N, 

' Apar(f) if N <7 or N = 2^18^1 - 1 or i < 2 Lis(^+i)J _ i 

Awin(7V)(*) = { I if 8 <N and 2L'g(^+bJ - i < i = _ 4 

0 otherwise. 

(141) 


10 


N 



14 

30 

62 


Figure 20: Discrete graphs of functions Awin(Ar) (*) for N = 14, 30, and 62 and 1 < i 1. 


Proof Follows from the above discussion. In particular, the equality (141) 


can be easily extracted from the code of Java program shown on Figure pT 
page 


□ 


Since Awin(i 2 )(^) = 0 for i < 11, win(12) is an optimal strategy for 11. 
Thus the heaps H 2 ,...,Hi 2 created by the hrst 11 moves of win(12) are all 
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lossless and, therefore, worst-case heaps. Hence, all worst-case heaps of 12 
nodes or less are lossless, too. 


I am going to prove in Section 10 that for every N > 2 (not jnst for 
N = 11), win(A^) is optimal for — 1. 

Lemma 9.8. Let for every i > 1, the delayed loss function X* be defined by 


the equation (129) in Lemma 9.4 page 58. For every N > 3, the following 
equality holds: 


N-l 


Awi„(K)(A'-l) = 5 ^A'W. 


(142) 


1=2 


Proof. By (129) page 57, for —l<i<iV — 1, X*{i) = 1 if, 

and only if, i = 2^^*^*^ — 4, the latter of which, by (141) page 65, holds for 


N-l 


2 Ug(^+i)J —l<i<iV — lif, and only if, A(i)win(v)(*) = 1- Thns, 

X*{i) = 0 = X{i)w]n{N)ii), 

so that 

i=2L*s{Ar+l)J _i 

Therefore, since 

Awin(v)((V-l) = Awin(7V)(2L'®('^+AJ _i^iV-l), 


in order to prove (142) it snffices to show that 


2Lig(JV+i)J _2 

Awin(K)(2^“<"+'>J-2)= Y1 


(143) 


i=2 


Since by (141) page 65, for all i < — 2, Apar(i) = A(i)win(Af)(*), 

Awin(7V)(2L's(^+AJ _ 2) = Ap3,(2L'g(^+i)J - 2) = 


[by the eqnality (133) in Lemma 9.5 page 59, snbstitnting [lg(A^ -|- 1)J — 1 
for D] 

2 Lig(Ai+i)J _2 


i=2 
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This completes the proof. □ 

A Java code that implements the strategy win(A^) is shown on Figure [I^ 
A worst-case heap on 28 nodes produced by the strategy win(28) and the 
sequence of the first 27 pull downs of win(28), which is the same as the 
sequence of the first 27 pull downs of win(31), is shown on Figure 21 The 


sequence of 14 heaps created by the last 14 of these moves is shown in the 
Appendix A.2 page 107 


RemoveAllRec: 28-elenient heap deconstructed in 159 comps. 

RemoveAULRec: Theoretic worst-case number of comps = 159 
RemoveAilRec: Number of comps for heap built with Par = 158 
>» Worst-case: 

28-element heap level-by-level: 

Level 0: 28 

Level 2: .-^23 4 

Level 3: 16^^^^""'-18 20^""^''''^22 i5-v-v^^^^25 

Level 4: 9''^12 6^^17 10^''! 1 5^''21 7^''13 8^''^4 l^'' 

Heapsort: array sorted in 207 comps 

Heapsort: Theoretic worst-case number of comps = 207 

The strategy was the first 27 pull downs of win(28): 

<1, 1, 1, 1, 2, 1, 1, 2, 1, 2, 2, 1, 2, 1, 1, 4, 1, 4, 1, 4, 1, 4, 1, 4, 1, 4, 1> 
BUILD SUCCESSFUL (total time: 0 seconds) 


Figure 21: The heap H 2 S produced by the first 27 pull downs of the strategy win(28) (an 
outpnt of my Java program). The first 27 pull downs are at the bottom of the output. 
All pull downs in the last level (the last 13 pull downs) are lossless; however, there is no 
lossless pnll down that the heap H 2 S would admit. 


Now, we are ready to compute the accumulated loss for each strategy 
win(Af). 

Lemma 9.9. For every N >2, 


Awin(iv) {N -l) = max{ [lg(iV -b 3)J, 3} - 3. 

A 


(144) 


Proof. By the equality (142) in Lemma 9.8 


Application of Lemma 


9.3 


page 


page 


57 


66 


(AT) (77 - 1) = 
completes the proof. □ 


A graph of function Av„jn( 7 v)(A^ — 1) is shown on Figure 


22 
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30 
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60 


Figure 22: A discrete graph of function Awin(Ar)(-/V-l) = Kin{N){i), where Xw\niN){i) 

is the function visualized on Figure [Id] page 57 Clearly, for N > 5, A„„jn(M)(A^ — 1) = 

Llg(iv + 3)J - 3. 


The Lower Bound Theorem 9.10. The payoff P^\„{^is[){,N — 1) after N— 1 
moves of the strategy win(iV) for N > 2 is given by this formula: 

2{N - 1) [lg(iV - 1)J - + min( [lg{N - 1)J, 2) + 4 + c, (145) 


where c is a binary function on the set of integers defined by: 

1 if N < -4 

0 otherwise. 


c = 


(146) 


Function c is visualized on Figure 23 It raises from 0 to 1 when the 


strategy par performs a lossy move in the level [Ig N\ , and drops from 1 to 
0 when the strategy win(Af) performs a lossy move in that level. 



10 20 30 40 50 60 70 


Figure 23: A discrete graph of function c. 


Proof. Since, by (121) page 


49 


,_. _ Pwin{iV)(iV-l) =F^^(JV-l)-A^,n(JV)(W- 

1), by virtue of (144), it suffices to prove that for any N > 2, ( 145[ ) is equal 
to 

pUB^N - 1) - max{ [lg(iV + 3)J, 3} + 3. (147) 

Once we pinpointed correct formulas, verification of that equality is easy. 

By Theorem 8.1 in [TOj, 

P^^(iV-l) = (2Af - 1) Llg(Af - 1)J _2Lig(^-bJ+2 + 4^ 
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thus ( 147[ ) is equal to 

{2N - 1) [lg(iV - 1)J - + 4 - max{ [lg{N + 3)J, 3} + 3. (148) 

It suffices to show that (145) is equal to ( 148| ) for all N >2. 


If 77 < 5 then min( [lg(A^ — 1)J, 2) = [lg(iV — 1)J, max{ [lg(iV + 3)J, 3} — 3 


= 0, and c = 0 so that (145) reduces to: 


2{N - 1) Llg(iV - 1)J - + [lg(iV - 1)J + 4, = 

{2N - 1) Llg(iV - 1)J - 2Lig(^-i)J+2 + 4, 


and so does (148). Thus (145) and (148) are equal in this case. 


If > 6 then min( [lg(iV — 1)J ,2) = 2, and max{ [lg(iV + 3)J, 3} — 3 = 


[lg(iV + 3)J — 3, so that (145) reduces to: 

2{N - 1) [\g{N - 1)J - 2Lig(^-i)J+2 + 6 + c = 

= {2N - 1) [lg(iV - 1)J - 2 L'g(^-hJ +2 _ Llg(iV - l)J + 6 + c, (149) 


and (148) reduces to 

{2N - 1) [\g{N - 1)J - 2L'g(^-i)J+2 - [lg(iV + 3)J + 7. (150) 


Thus, in order to show that ( |145 ) is equal to (148) it suffices to show that 
(149) is equal to (150), or, subt ractin g the common term {2N — l)[lg(A^ — 
1)J — _|_ g from (149) and (150), to show that 


that is. 


■Llg(iV-l)J+c=-Llg(iV + 3)J+l. 
Llg(iV + 3)J-Llg(iV-l)J=l-c, 


or, incorporating the dehnition (146) of c. 


Lig(iv + 3)j-Lig(iv-i)j = 


0 if iV < 2rig^l -4 


1 otherwise. 


(151) 


I will show that for every N > 3, (151) holds. 
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Indeed, if - 4 then \g{N + 3) < lg(2r'g^l - 1) < lg2r'g^l 

= [IgiV]. So, Llg(A^ + 3)J < [IgiVl, that is, Llg(A^ + 3)J < [IgiV] -1 = 
[lg(A^ — 1)J. Thus [lg(A^ + 3)J < Llg(^ ~ 1)J- Hence, since [IgA^J is a non¬ 
decreasing function, [lg(iV-|-3)J = [lg(A^—1)J and [lg(A^-|-3)J — [lg(iV—1)J = 


0, thus proving (151) in this case. 


If, however, N > 2^^^^^^ — 4 then N > 2^^^^^^ — 3 and Llg(iV -|- 3)J > 
[lg2rig^lj = LflgiVlJ = [IgiV] = [lg(iV - 1)J + 1 > [lg(iV - 1)J. Thus 
[lg(A^ -I- 3)J > [lg(iV — 1)J. Hence, Llg(iV -|- 3)J — [lg(A^ — 1)J > 1 and, since 
for TV > 3, Llg(iV + 3)J - Llg(iV - 1)J < 1, [lg(iV + 3)J - Llg(iV - 1)J = 1, 


thus proving (151[) in this case, too. 


Since there are no other cases, this completes the proof of (151). 


Thus (145) and (148) are equal in this case, too. 
of the theorem. 


This completes the proof 

□ 



Figure 24: A graph of function F’win(v)(TV — 1) of payoff for strategy win(A^). Also, a lower 
bound on the number of comparisons of keys performed in the worst-case by RemoveAll 
on an A^-element heap. 


Figure 24 shows a graph of the total payoff function Pwin(Ar) for win(iV). 
Obviously, Pwin( 7 V), given by (145), is a lower bound on the number of com¬ 
parisons of keys that RemoveAll performs in the worst case on an iV-element 
heap. I will prove in the next section that it is an upper bound on that 
number, too. 
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10. The proof of the winning 


Granted the optimality of par for each N = — 1, all heaps produced 

by gry are worst-case heaps as long as the corresponding pull downs are 
lossless. If no greedy strategy exists that would perform more consecutive 
lossless pull downs on any worst-case heap on — 1 nodes than gre does 

on the worst-case heap H 2 Ligivj_i then, taking into account that, by virtue of 
the equality ( 141[ ) page 65, gre loses no more than 1 credit relative to 
in the entire [IgiVjth level, all heaps produced by gre are worst-case heaps, 
thus making the combination par -|- gry an optimal strategy for N. 


So, the basic question that needs to be answered in order to decide the 
optimality of win(A^) for N is how many consecutive lossless pull downs a 
worst-case heap on 21-^®'^-! — 1 nodes may admit. I am going to show that no 
more than — 3, that is the same as win(iV) does admit on the heap 

H 2 LigAfj_i. From that the optimality of win(iV) for N will follow. 

If = 21-^®^-! then pulling down 1 yields 2([lgiVj — 1) credit, which is 
maximal, so it implements the desired lossless strategy for the level [Ig A^J. 

If A^ = 2L^®^J -|- 1 then pulling down 1 twice or pulling down 1 and then 
pulling down its new parent yields 2([lg77j — 1) -|- 2([lg77j — 1) — 1 = 
4[lgA^J — 3 credit, which is maximal, so it implements the desired lossless 
strategy for the level [Ig A^J. 

If A^ > -|- 1 then the hrst two pull downs in the last level [IgA^J 

determine the maximum credit that the best greedy strategy in that level 
can collect. 


Let p and q be the hrst two nodes that ended up in the last level [IgA^J 
as a result of the hrst two pull downs. (In the example of strategy gry for 
^ _ 2Ug^J -j- it would be 1 and the hrst node in level [Ig A^J — 1 after the 
strategy par was executed.) 

Consider heap that was produced after these two pull downs. 

How far any strategy could proceed without a loss of credit relative to the 
upper-bound payohP^^(2l-^®'^-l-|-l , m)? It turns out that the index of the hrst 
node r in the level [Ig A^J — 1 of heap that is less than the maximum 

of p and q (see Figure [25|) puts the limit on the number of consecutive lossless 
pull downs that the strategy in question can make. This intuitively obvious 
fact has a surprisingly unobvious proof, quite a typical sample of the hne- 
grain complexity of this subject matter. 
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r 

p q 


Figure 25: Example heap iF 2 Ligwj+i on N = _|_x nodes with nodes p, q, and r shown. 

Nodes p and q are the only nodes in level [Ig . Node r (less than maxjp, g}) is in the 
level [IgiVj — 1. The index of r is j, and the indicies of p, q are N — 1, N, respectively. 


Lemma 10.1. Let H be a heap on N = + 1 nodes, where N > 5 , 

and let p and q be its last two nodes. If j is the index of a node r in the 
level [IgiVj — 1 of heap H with r < max{p, g} then heap H does not admit 
a sequence of more than 2j — N consecutive lossless pull downs. 


Proof. Since r < max{p, g}, r is not a parent of p and g, the only nodes in 
the last level [IgiVj of H. So, r is not a parent of p or g. Thus r is a leaf in 
H, and so 

N < 2j. (152) 

Because [IgjJ, the level number of r, equals to [IgiVj — 1, 

Llg(2j + l)J = [lg2jJ = [IgiVj. (153) 


Let’s assert that the lemma is false. Let v be a sequence of 2j — Af + 1 
consecutive lossless pull downs executed on H. By (152), 2j — iV + 1 >2. 
Let 2 < 2i < 2j — Af + 1, let 77'' be the heap on 77 + 27 nodes produced by 
the first 27 pull downs (ui, ...,V 2 i) of v, and let 821 be the set of all ancestors 
(proper and improper) of the nodes in the last le vel K = [lg(Af + 27)J of 
heap H' \ with convention H' = H. By virtue of (153), 


A'= [IgVJ. 


(154) 


Since the number Af + 27 of nodes of is odd, the number of nodes in the 
level K of 
sibling in 77'^* 


is even. In particular, every element of level K in H' has a 
also an element of that level. 


First, I am going to show that a pair V 2 i-i,V 2 i of lossless pull downs 
may only pull down elements of 821 - 2 - Let s, t be the nodes pulled down 
by V 2 i-i,V 2 i, respectively. For the pull down V 2 i-i to be lossless, by the 

taking into account that 


Credit Loss Characterization Lemma 7.1 


page 


39 


the last level of each heap in question has at least 2 nodes so that each of 
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the conditions 
and 


111 


IV 


and (ii) of that lemma entails the disjunction of conditions 


s must be a sibling of a node in the level K of heap H' 


or 

a parent thereof. In particular, it must be an element of 821 - 2 - For the pull 
down V 2 i to be lossless, by the Credit Loss Characterization Lemma [TA, t 
must be a sibling of a node in the level K of heap H' ^ or a parent thereof. 
The only such node that can possibly be not in 821-2 is the (new) parent of 
s in * , which, however, has one child only (s, that is) because H' * has 
an even number of elements and so its last level has an odd number of nodes, 
so that the last element (s, that is) of the last level of H' * cannot have a 


sibling. Therefore, by , by the Credit Loss Characterization Lemma 7.1, the 
(new) parent of s in 77' * cannot be pulled down or otherwise V 2 i would be 
lossy. 


Second, I prove by induction on i that for every i with 0 < 2 % < 2j —A^+1, 
the two smallest elements of 82 % are {p, q}. Indeed, p and q are the only nodes 
in the last level of the heap H = H' , so S'o = {p, q} U Aq, where Aq is the 
set of proper ancestors of {p, q} in the heap 77. Since no element in Aq is 
less than max{p, q}, the two smallest elements of S'o are {p, q}. For i > 0, we 
have 821 = 82 i -2 U A 2 i, where A 2 i is the set of proper ancestors of the nodes 
in the last level of heap H' \ because moves V 2 i-i,V 2 i pulled down elements 
of 821 - 2 - Since no element in A 2 i is less than max{p, q}, the two smallest 
elements of 821 are the same as the two smallest elements of 821 - 2 , that is. 



Figure 26: Example heap Ff 2 Li 8 «J+i of Figure 25 page 72 with r’s future children s and t 
(not necessarily different from p, q) shown. The nodes s and t are pulled down by the last 
two moves V 2 j-N and V 2 j-N+i the of sequence v. Their new indicies (after the said pull 
downs) are 2j and 2j + 1, respectively. If the first 2j — N pull downs of v are lossless then 
2j + 1 — A^th pull down V 2 j+i-N of t is lossy. 


Because r < max{p, q}, r ^ 82 m for any 0 < 2m < 2j — N + 1. Therefore, 
none of the moves of the sequence v pulls down r, which stays put during the 
execution of v. Thus the last two moves V 2 j-N and V 2 j-N+i the of sequence 
V result in attaching the (new) children s, t to node r (see Figure [2^. They 
must both pull down elements from the set 82 j-N+i- Since there is at most 
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one element of S 2 j-N+i that is less than r and qualifies as its child, such a 
case is clearly impossible. This contradicts the assertion that the lemma is 
false. □ 


The following technical lemma uses function of accumulated loss of 

It derives properties of some 


credit given by the equation (115) page 48 


heaps H and their residua H from properties of virtually unrelated heaps G 
and their residua G of the same sizes as H and H, respectively. Although 
the facts spelled out by this lemma may seem inconsequential and intuitively 
obvious, and have elementarily-algebraic and straightforward proofs, they 
have allowed me to cut the lengthj^ of some of the lengthiest proofs of some 
fundamental results in this Section. 


Diagram Lemma 10.2. Let H and G he heaps on N > 2 nodes, and H 
and G he their respective residua on 2 < M < N nodes. If H is a worst-case 
heap then the following are true: 

(i) If G = H then Ap^j < Aqq. 

(ii) If G is a worst-case heap and Ap^ > Aqq then H and G are a worst- 
case heaps and Ap^ = Aqq. 

(iii) If G is a worst-case heap hut H is not, and Aqq < 1 then G is a 
worst-case heap. 

(iv) If G is a worst-case heap hut G is not, and Aq^ < 1 then H is a 
worst-case heap and Ap^ = 0. 

Proof. Consider a differential diagram (a weighted directed graph) D, an 
example of which is shown on Figure]^ whose nodes H, G, ..., are heaps 
and weights of edges {H, G) are differences 


dG,H — G’i 


RemoveAll 


oW-a 


RemoveAll 


o(G'). 


(155) 


where C'RemoveAii()(-^) is the number of comparisons of keys performed by 
the execution of X.RemoveAll() on heap X. Obviously, any differential di¬ 
agram D commutes, that is, any two paths in D with the same source and 
destination have equal weights. 

In particular, the diagram with heaps G, H on N nodes each, and their 
respective residua G and H on M < N nodes each, visualized on Figure 
commutes. 


28 


®°By about a factor of four or so. 
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F 


Figure 27: An example of a commuting diagram; dp,a + dG,H = dpm- 

dam 


p^^-A 


G,G 



p^^-A 


H,H 


Figure 28: Another commuting diagram. 


Here N — 1) is the upper bound on credit for the sequence 

of pull downs that produces an iV-element heap from and M-element heap, 
and Aqq and Ap^j are accumulated losses of credits given by the equation 


(115) page 48, so that 


P 


UB 


Hg = 


[by (118) page 49 


— P- — 

~ ^G,G ~ 


[by (155) and (113) page 48 




and 


P 


UB 


GG-! 


^H,H ~ ^H,H- 


Since the diagram on Figure [^ commutes, we have: 

p’^ — Aqq + dc^H = dQjj = dQp + 
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which proves 

dG,H + Ajjfj = (Iqp + Aqq. 

Let’s assume that if is a worst-case heap. In such a case, 

dG,H > 0 . 

If G = then = 0 and, by (156), 


(156) 


( 157 ) 


^H,H ~ ^G,H + ^G,G dG,H — ^g,G dG,H < ^G,G> 


that is 

This proves 
If 


^H,H — ^G,G- 


^H,H — ^G,G- 


(158) 


(159) 


then by (156) 


[by dl^ ] 
that is. 


'^gA ~ dG,H + ^h,h ^g,g — dG,H ^ 

> 0 , 


dak — 0- 


Thus if G is a worst-case heap then if is a worst-case heap, too. Hence, 


dcfi — 0- 


By[^ inequality (158), and inequality (159) 


^kH — ^G,G- 


(160) 


(161) 


By virtue of ( |156| ) and ( |160[ ) and ( |161[ ), 

dG,H = 0 . 


Therefore, G is a worst-case heap. This and (161) prove 
If G is a worst-case heap but H is not then 


11 


d^H ^ 1 - 


(162) 
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Therefore, if < 1 then (156) yields: 
dan = d 


G,H + ^G,G “ d^H,H < -1 + 1 - < 0. 


Since 77 is a worst-case heap, so is G. This proves (iii) 


If G is a worst-case heap but G is not, and < 1 then, by (iii) 
a worst-case heap, and so 

d^H = 0 - 

Also, since H is a worst-case heap, 

dG,H > 0. 


H is 

(163) 

(164) 


From (163) and (156) I infer 


that is. 


[by ( [lM| )] 
Hence, 
that is. 

This proves 


dG,H + Apjj — Aqq, 


^H,H — ^G,G dc^H < 


< Aqg < 1- 


A 


H,H 


< 1, 


^H,H = 0 . 


IV, 


□ 


The following lemma imposes rather rigid limits on how far one optimal 
strategy can fall behind another. It is a consequence of the existence of 
strategies that are lossless up until level 3 and, like the substrategy (1, 2,...) 
of alternating pull downs of 1 an 2, lose only one credit per level relative to 
their respective upper bounds — 1, 2^^^ — 2) for each level K > 3, 

with which optimal strategies must compete in order to win. 


®^One can directly compute P™(2^ — 1, 2^+^ — 2) for any K > 1 as 

9K + 1_9 97<r + l_9 

= 2Llog(*+1)J - 3 = 2K — 2> = — 3. Intuitively speaking, there 

are 2^ pull downs between 2-^ — 1 and 2^+^ — 2, each collecting maximal credit of 2K, 
except for the first one that loses 2 credits relative to 2K and for the second one that that 
loses 1. 
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Competition Lemma 10.3. For every worst-case heap H on N > 2 nodes, 

(i) the last level of H can lose no more than 1 credit, and 

(ii) if the last level of H is lossy than heap H’s complete residue H on 

M = — 1 nodes is a worst-case heap. 


Proof. Let 77 be a worst-case heap on iV > 2 nodes, 77 be the 77’s complete 
residue on M = 21-^®^-! — 1 nodes, and be the loss of credit, relative 
to the upper bound — 1), incurred by the sequence of N — M 

consecutive pull downs that produce 77 out of 77. Since N < — 1, 

we have: 

_/Y _ _ 2^8'^! -)- 1 = 2^^®'^-! 

that is 

N-M<M + 1. (165) 

Consider a sequence (1,2,...) of N — M alternating pull downs of Is and 
2s consecutively applied to a heap G on M nodes. Let G be the heap on 
N nodes that is a result of that application and Aqq be the loss of credit, 
relative to the upper bound N — 1), incurred by that sequence. By 

virtue of the pq Lemma 7.2 (I)] page 43, applicable because of the inequality 

(@, 

Ago < 1. (166) 


Let G be equal to 77. Application of Diagram Lemma 


10.2 


yields 


that is, by virtue of (166), 


This completes the proof of 


^H,H — ^G,G’ 


AftH < 1- 


11 


If the last level of 77 is lossy then 


that is, by virtue of (166), 


^G,G — ^H,H- 


(167) 


Let G be a worst-case heap. By virtue of Diagram Lemma 


10.2 


applicable because of (|167|) 77 is a worst-case heap. This completes the proo 
of 


11 


□ 
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Lemma 10.4. Heap H on N = nodes, where N >2, admits lossless 

sequence of of N — 1 consecutive pull downs if, and only if, its last node H[N] 
is 1 and its parent H[^] is 2. 

Proof. (<^=) The sequence of — 1 consecutive alternating pull downs 
(1, 2,1,2,1,2,1) is an example of the said lossless sequence. 

(^) If 1 is not in the level Ig N of heap H then it has to be pulled down 
at certain point before completion of any sequence of iV — 1 consecutive 
pull downs. Because 1 has no children, that pulling down cannot be lossless 
except in a heap on 77 = 2— 1 nodes, which is not the case here. So, 
H[N] = 1. The only lossless pull downs that can be applied to H in this case 
is pulling down 1 or its parent 77[y]. Each of those will move the parent of 1 
from level IgiV — 1 down to level Ig If the parent 77[y] of 1 in heap H is 
not 2 then 2 resides somewhere else as a leaf in the level Ig A^ — 1 of heap H 
after the index [yj, and will terminate any lossless sequence of pull downs 
right after 1 became the left child of 2. So the said parent 77[y] must be 2 
in order for the said sequence of of 77 — 1 consecutive lossless pull downs to 
exist. □ 


Lemma 10.5. If heap H on N = nodes, where N > A, admits se¬ 

quence of N — 1 consecutive lossless pull downs then in any such sequence 1 
has to be pulled down at least once and 2 has to be either pulled down at least 
twice or demoted at least once and pulled down at least once. 


By Lemma 


10.4 


the last node H[N] is 1 and its parent 77[y] is 


Proof. 

2. The only lossless pull down that heap H admits is pulling down 1 or its 
parent 2, any of which creates a heap 77' on A^ +1 nodes whose last two nodes 
(the only two nodes in the last level k of heap 77) are 1 and 2. The parent a 
of 2 in heap 77' is the same as the parent of 2 in heap 77 as it got demoted 
with the demotion of 2 caused by the pull down of 1 or with the pull down 
of 2. Since 1 and 2 are siblings in heap 77', a is also the parent of 1 in heap 
77'. The right sibling of 2 in heap 77 is not 1 because 1 is the left child of 2 in 
heap 77. So, the right sibling of 2 in heap 77 is greater than 2 and, therefore, 
the parent a of 2 in heap 77 is greater than 3. Thus 3 is not the parent of 1 
or 2 in heap 77', so 3 must be a leaf in heap 77' and as such it must belong to 
the level [IgiVj — 1 in that heap. As a leaf in heap 77', it cannot be pulled 
down without a loss. So it will remain in the level [IgTVj — 1, acquiring, 
eventually 1 and 2 as its children since no other node can be a child of 3 and 
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all nodes in the level [IgA^J — 1 of any heap on — 1 nodes produced 

from H by any sequence of — 1 pull downs has two children each. So, in 
order for the sequence of — 1 to be lossless, both 1 and 2 must be pulled 
down or otherwise they won’t become the children of 3. 

Whatever was the case in the sequence described above, 1 is pulled down 
at least once (while making it a child of 3), and 2 is either demoted once (if 
1 is pulled in the first move) and pulled down once (while making it a child 
of 3) or pulled down twice (in the first move and while making it a child of 
3). □ 


Lemma 10.6. If heap H on N = nodes, where N > A, admits lossless 

sequence of of N — 1 consecutive pull downs then any such lossless sequence 
produces a complete heap H' on — 1 nodes whose first node of its last 

level is greater than or equal to 4. 


Proof. By Lemma 10.4, II[N] = 1 and H[^] = 2. Therefore, neither 1 nor 
2 are children of node 3 in heap iL (1 is the child of 2 in iL and sibling of 2 
in H is greater than 2) nor its descendants. 

By Lemma 10. 5[ node 1 cannot maintain its index N because it is pulled 
down from that index. Node 2 can only be moved at index N as a result of 
its demotion. It cannot maintain its index N because is pulled down from 
that index. So the first node of its last level of heap H' is neither 1 nor 2. 

Node 3 gets never moved at index N, because it is not an ancestor of 
nodes 1 and 2. So, it cannot be there in heap H'. (An analysis of the part 


of the proof of Lemma 10.5 regarding position of 3 in heap H could yield a 
more refined argument.) 

So, the said first node in the last level of heap H' is neither 1,2, nor 3. 
Hence, it is greater than or equal to 4. □ 


Lemma 10.7. If heap H on N = 2^^^^^^ — 1 nodes, where N > 3, admits 
a sequence of N + 1 consecutive lossless pull downs then any such lossless 
sequence produces a complete heap H' on — 1 nodes whose first node 

of its last level is greater than or equal to 4. 


Proof. If H admits a sequence of A^ + 1 consecutive lossless pull downs 
then the heap G that is produced by the first of those pull down admits N 
consecutive lossless pull downs. Since the size of G is M = 21^*®'^^ = 
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and N = M — 1, Lemma 10.6 applies, a consequence of which yields the 
thesis of this lemma. □ 

The following seemingly inconsequential Lemma is critical for my proof 
that win(iV) is an optimal strategy for N. 

Worst-case Heap Characterization Lemma 10.8. The first node of the 
last level of any complete worst-case heap on more than three nodes is greater 
or equal to 4. 

Proof Let if be a complete worst-case heap on 2*^ — 1 nodes, where k > 3. 
I will prove by induction on k that the first element of the last level 

k — 1 of H satisfies this inequality: 


H[2'‘-^] > 4. 


(168) 


There are only four worst-case heaps H on 7 nodes, and for each of them 
H[4:] =4 or Lf[4] = 5. (One way of showing it is to inspect the results of 
all possible 6-elements sequences of lossless pull downs.) Thus the lemma’s 


thesis (168) is true for k = 3. 


For the inductive step, let’s assert that k is the smallest number greater 


than 3 for which the lemma’s thesis (168) is false. Let v be a strategy that 
produces H in its 2^ —2nd move V 2 k_ 2 , and H = {Hi \ i G a;’'') be the sequence 
of heaps produced by v. We have H = H 2 k_i. By the above assertion, we 
have: 

H2k_i[2^-^] < 3. (169) 


By Lemma 10.7, if the last level k — 1 of heap iL 2 '=-i were lossless, that 


is, if all pull downs n 2 fc-i_i through n 2*-2 earned maximal credits, then the 
first element H2k_i[2^~^] of level k — 1 would have to be at least 4, which, by 
(169), is not the case. So, the last level k — 1 of heap 772'=-! is lossy and, by 
Competition Lemma 10.3 (ii), the previous complete heap 772fc-i_i must be 


a worst-case heap. By the induction hypothesis, the first node 772fc-i_i[2^ 
of the last level k — 2 of heap 772fc-i_i must satisfy this inequality: 


772.-i_i[2'=-2] > 4. 


(170) 


Because of that, the only scenario under which the inequality (169) is 
satisfied is that the node 772fe_i[2*’“^] of heap H 2 k_i was pulled down there by 
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the move V 2 k-i_i 
The heap H 2 k-i 
V 2 k-i-i has only 

(169) , satishes p 

(170) . 


executed on heap H 2 k-i_i and never pulled down thereafter, 
(not to be confused with heap i72'=-i) produced by move 
one node p = H 2 k-i[ 2 ^~^] in its last level k — 1, which by 
< 3. Its parent, q = H2k-i[2^~‘^], satishes g > 4 by virtue of 



Figure 29: Example heap H 2 k-i on 2^ ^ with nodes a, b, p, and q shown. Node p is the 
only node in level k — 1. Nodes a, b, and q (the parent of p) are in the level k — 2. 


Let a and b be the elements of the set {1,2,3} \ {p}. (For algebraists, 
one can put a = pmod 3 + 1 and b = (p + 1) mod 3 + 1.) An example of 
such an arrangement is shown on Figure]^ Neither a nor b is the parent q 
of p in heap H 2 k-i since g > 4. So, none of them can have more than one 
child, which means that they must belong to level fc — 2 of heap H 2 k-i since 
all nodes above that level have two children each. Because the only element 
of level fc — 1 of heap H 2 k-i is p, and neither a nor b is the parent of p, both 
nodes a and b are leaves in the level k — 2 of heap H 2 k-i and are less than any 
other node of heap H that is pulled down in moves {v 2 k-i, ■■■,V 2 k_ 2 ) (recall 
that node p is not to be moved from its index 2^“^). Therefore, neither 
of them can acquire two children. Since all nodes in level A: — 2 of heap 
H 2 k_i have two children each, they both must be pulled down, eventually, 
from the level k — 2 to the level A; — 1 by some moves in {v 2 k-i, ...,V 2 k_ 2 ), 


as shown on Figure 30, each of those pull downs, by virtue of (83) and (85) 


Figure 30: Heap iF 2 *'-i on 2^ — 1 nodes produced by the sequence of pull downs 
(u 2 fc-i,..., U 2 f=_ 2 ) from example heap H 2 k-i of Figure 
level k — 1. 


29 


Nodes a, b, and p are in the 
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page 38, earning at most 2(k — 1) — 1 credilj^ that is, each causing a loss of 
credit to the maximum of credit 2{k — 1). Thus the sequence of pull downs 
{v 2 k-i, ■■■,V 2 k- 2 ) must lose more than one credit and, therefore, the sequence 
of pull downs {v 2 k-i_i, ■■■,V 2 k_ 2 ) must lose more than one credit. Hence, by 
Competition Lemma 10.3 (i) and (ii), the resulting heap H = H 2 k_i is not 
a worst-case heap, contrary to the assertion made at the beginning of this 
proof. □ 

At this point I am ready to conclude a fundamental fact that comes handy 
while proving the optimality of strategies par and win. 


Theorem 10.9. Let H be a complete worst-case heap on N = — 1 

nodes, where N > 7. H does not admit a sequence of more than N — 2 
consecutive lossless pull downs. 


Proof. Let 2 < K = [IgA^J and H he a complete worst-case heap on 
N = — l nodes, where K >2. By the Worst-case Heap Characterization 

Lemma 10.8, the first element p = H\f2^] 
larger. Let v = {y. 


of the last level Lf of if is 4 or 
1 < i < n) be a sequence of consecutive lossless pull 
downs that heap if can admit, and let n be the length of v. Since the first 
two pull downs on a complete heap can clearly be lossless, I may assume 
without a loss of generality that n > 2. Let for any 0 < i < n, H'" be the 
heap on iV -|- i nodes produced by the first i pull downs (ui, ...,Vi) of v, with 
convention H' = if. In particular, H' = H' is the heap produced by pull 
down vi executed on ff, and if" = if' is the heap produced by pull down 
V2 executed on if'. 


Move Vi can pull down, and losslessly so, any node of the last level K of 
heap if. In particular, it can pull down p, in which case p will become one of 
the two nodes of the last level iC -|- 1 of heap if". If Vi pulls down a node q 
different than p then the second move V 2 , in order to be lossless, must either 
pull down the parent p of g in heap if', or pull down q, which will case a 
demotion of its parent p in heap if'. In either case, p will end up in the level 
ff -|- 1 of heap if". So, let p and q be the nodes of level if -|- 1 of heap if". 
Since p > 4, we have max{p, g} > 4. Let’s see where the nodes 1,2 and 3 
can reside in heap if". 


®^Each of them will earn only 2{k — 2) credit if the larger of the two gets pulled down 
before the smaller of the two. 
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Each element of level K — 1 oi heap H” and above that level has two 
children, so neither node 1 nor node 2 can be there since none of them can 
have two children. So, nodes 1 and 2 reside somewhere in levels K and K + 1 
of heap H”. 

If node 3 does reside above level K of heap H” then 3 must have two 
children. These can only be 1 and 2, and if so then 1 and 2 must be siblings 
and, therefore, must not reside in different levels of heap H". Level K + 1 
of heap H" consists of two nodes p and q, one of which, p, is larger than or 
equal to 4, so both 1 and 2 reside in level K of heap H" in such a case. 

If node 3 does not reside above level K of heap H” then it resides in level 
K or K + 1 oi H”. In this case, all three nodes, 1, 2, and 3 reside in levels K 
and K + 1 oi heap H”. Level K + 1 oi heap H” consists of two nodes p and 
q, one of which, p, is larger than or equal to 4, so at least two of the nodes 
1, 2 and 3 reside in level K heap H" in such a case. 


Whatever the case, at least two of the nodes 1, 2 and 3 reside in level K 
of the heap H". Let j be the minimum of the indices of those nodes 1, 2 and 
3 that reside in level K of the heap H". Because the maximum index in level 
K of H" is N and there are at least two such nodes, j < N — 1. 


We have: H"[j] < 3. In particular, H"[j\ < max{p,g}. Heap H" has 
+ 2 nodes so, by Lemma 10.1 it does not admit a sequence of more 
than 2j — (A^ + 2) consecutive lossless pull downs. Now, 2j — {N + 2) < 
2N — 2 — {N + 2) = A^ — 4. Therefore, the sequence v of consecutive pull 
downs has the length n < A^ — 4 + 2 = N — 2. Thus heap H does not admit 
a sequence of more than N — 2 consecutive pull downs. □ 


It may seem that the preceding technical lemmas, particularly. 
Lemma 10.9, actually nailed the proof that win(A^) is an optimal strategy 
for N^ but such an idea is a bit deceitful as it is tacitly based on an un¬ 
pronounced assertion that only a quasi-greedy strategy whose — 2nd 

move produces a worst-case heap , which win(A^) is an instance of, can be an 
optimal one. Unfortunately, there exist other strategies that are optimal for 
N^ for instance, strategies that lose to win(A^) in — 2nd move but are 

lossless afterwards so that, if A^ > 21^^^^^^ —4, they can catch on with win(A^) 
at the end of the last level [Ig A^J of the worst-case heaps that they produce. 
Here is an example of such an optimal strategy. 
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Example 10.10. The heap G4 = [4,3,2,1] is a worst-case heap and is 
not a residue of any complete worst-case heap. However, it can he pulled 
down onto a worst-case heap of arbitrary large size via, for instance, Gj = 
[7, 5,6, 3,4,1,2] that, of course, is not a worst-case heap. The following very 
simple strategy z{N), optimal for iV = / +1 and 1 + 2, where I = 2^^^^'^^ — 4 
may be constructed from Gy. Beginning with move 1, play a version of the 
strategy win (TV) on Gy, that is, in any level beyond level 2, except for the 


last level if it is not full, play a version of the strategy par^ that would as 


sure that 3 resides at the first index of the last level of any complete heap 
Gm created this way, and 1 and 2 reside at the last two indices of that level. 
If the last level of G^ is not full then play the strategy gre in that level. 
No complete heap produced with par beginning from its 7th move on heap 
Gj is a worst-case heap, simply because Gy is not and par loses 1 credit in 
every level beyond level 2. However, the restriction imposed by the Theo¬ 
rem lO.f^ does not apply, and, as it turns out, all complete heaps G 2 K- 1 , 
where 3 < iV < IgiV, produced by z{N), admit more than 2^ — 3 loss¬ 
less pull downs eac/j^ This allows z{N) to catch on with win(A^) in move 
m = 2 !^^^-^^ — 4 and recover the 1 lost credit from its move 6 (that created heap 
Gj). For example the following heaps created by z(14) in its 12th and 13th 
moves are worst-casG^ heaps: G 13 = [13,12,7,9,11,6,2,5,8,4,10,1,3] and 
Gi 4 = [14,12,13, 9,11, 7,2, 5, 8 ,4,10, 6 , 3,1]; their only worst-case residua are 
heap Gq = [ 6 , 5, 2, 3,4,1] and the residua of Gg 


Below I prove, by neat induction, an optimality criterion for strategy par, 
from which I am going to conclude the optimality of win(iV) for any given 
N. 

The 1®* Optimality Theorem 10.11. Strategy par is optimal for N if, and 
only if, 

(171) 


®^z(7V) is not optimal for any other N > 7. 

®‘^Just keep pulling down 1 and 2 indefinitely. 

852^ _ pg exact; 1 and 3 can be pulled down repeatedly without a loss of credit 
until 1 becomes the first child of 2 and 3 cannot be pulled down. 

®®But not hereditary worst-case heaps. 

®^Similar restriction applies to all heaps Gn on N > 7 nodes produced by z{N); in 
particular, no has more than 7 worst-case residua. 
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Proof. The only if part follows from the fact that strategy win(iV) beats 
par at — 1 |^for all N < — 4. 

I will prove the if part the by induction on [IgA^]. 


Since, by Theorem 9^ page the hrst 7 moves of par are lossless, par 
is an optimal strategy for 2 < N < 8, that is for 1 < [IgiV] < 3. 

Let [IgiV] > 3. The inductive hypothesis implies that par is an optimal 
strategy for N = — 1 Let M = 2^^^— 3. If the heap G created 


by par’s hrst M — 1 moves is a worst-case heap then, since by Theorem |9.2 
page 56 all the moves K < 2nsK] of 

par are lossless, in particular, moves 
M,M + 1, and M + 2 are, the heaps G', G", and G”' created by par’s hrst 
M, M -\-l, and M + 2 moves (respectively) are worst-case heaps, too. In such 
a case, par is an optimal strategy for all K with iL > 2^^^^'^^ — 4 as long as 
[IgiL] = [IgiV] -|- 1 1^ which observation completes the inductive step. So, 
all I have to prove at this point is that G is a worst-case heap. 

Let us suppose to the contrary that G is not a worst-case heap. Let G be 
G’s residue on N nodes. Obviously, G is created by the hrst N — 1 moves of 
par. Also, 

(172) 


Ago = 1- 


Since par is optimal for A^, G is a worst-case heap. Let iL be a worst- 
case he ap on M nodes and H be its residue on N nodes. By the Diagram 
Lemma 10.2 (iv) page 


74, is a worst-case heap and 


A«« = 0. 


(173) 


Thus 77 is a complete worst-case heap on 77 > 7 nodes that admits 
M - N = 2^'^^^+^ - 3 - 2^^®^^ -7 1 = - 2 = A^ - 1 


consecutive lossless pull downs, contradicting Theorem 10.9 Therefore, G is 
a worst-case heap. □ 

Since win(77) coincides with par for N > 2^^^^^^ — 4 and beats par at A^ — 1 
for N > — 4, the fact that par never loses more than 1 credit to an 

optimal strategy allows me to easily conclude an optimality of win(A^). 


®8ln first N - 1 moves, that is. 

s^Because for [Ig > 2, [lg(20g^l - 1)] = [IgiV] and 20g^l - 1 > 2rig(2^‘®"^-i)l _ 4 . 

®°Because for |"lg A^] > 2, |"lg(M -l-i)] = |"lg A^] -|- 1 and M + i > 20g('^+®)l _ 4 are true 
if, and only if, i = —1, 0,1, 2. 
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The 2"'^ Optimality Theorem 10.12. For every N > 2, win(iV) is an 
optimal strategy for N. 


Proof. The first N — 1 moves of win(iV) coincide with the first iV — 1 moves 
of par if iV > — 4, thus for every — 4, win(iV) is an optimal 

strategy for N. 


Let 


TV < 2^^^^^ -4, 


(174) 

1 moves 


and let 77 be a heap on 21-^®'^-! nodes produced by the first 21-^®'^-! 
of win(77). Since H is also a heap produced by the first 2L*§'^J — 1 moves of 
par, by The W O ptimality Theorem 10.11 , 77 is a worst-case heap. Now, by 
the equality (141) in Theorem 9.7 page 65, all moves 21-^®through — 1 of 
win(77) are lossless. Therefore, the heap G that is produced by those moves 
is a worst-case heap, too. Since G is a heap produced by produced by the 
first N — 1 moves of win(A^), by virtue of Theorem 8.3 page 51, win(A^) is a 
winning strategy for N. □ 


10.1. A comment on the proof of optimality 

Basically, all the trouble that we went through in this Sectiorj^ was to 
prove the following deceitfully simple fact that entails the optimality of strat¬ 
egy win (77) for any N >2. 


Singularity Theorem 10.13 

admits a lossless pull down 


No worst-case heap on N = 2^^^^'^^ —4 nodes 


92 


Proof. Assume to the contrary that A^ = 21^*®^^ — 4 and 77 is a worst-case 
heap on N nodes that admits a lossless pull down. Let 77 be a residue of 77 
on M = nodes, G be the heap on M nodes created by the first M — 1 


moves of strategy win(77) and G be the heap on N nodes created 
TV — 1 moves of strategy win(77). By The P* Optimality Theorem 


3 y the first 
10. Ill G is 


®^Now, after I am done with all the details of my analysis, I can sense a trace of frustra¬ 
tion, to which I can relate, in the comment that Donald Knuth wrote in |^: “Algorithm 
H is rather complicated, so it probably will never submit to a complete mathematical 

analysis [...].”. _ 

®^The following stronger version of the Lemma 10.13 can be proved: No worst-case heap 
on N nodes admits a lossless pull down if, and only if, N = 20sN — 4. 
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a worst-case heap. By the Diagram Lemma 10.2 (ii) page 74, all the N — M 
pull downs that reconstruct heap H from its residue H are lossless, thus G 


admits A^ — M-l-1 = M — 1 lossless moves, contrary to Theorem 10.9 page 83 


□ 


Note. The Singularity Theorem 10.13 can be expressed in the following 
equivalent form without any reference to pull down. 

Theorem 10.14. For every N = — 3 and every heap H on N 

nodes, the heap i7.RemoveMax() is not a worst-case heap or 

CRemoveMaxO (77) < C'^moveMax() (^) • 


Proof of the equivalence of Theorems |10.13| and |10.14[ If 77 is a 

worst-case heap on _ 2 rig^l _ 4 nodes that admits a lossless pull down p 
then the heap H' that is the result of application of p to 77 is a worst-case 
heap on N' = _ 3 nodes with C'RemoveMax()(7r') = C'^“oveMax()(^)- 

This would make Theorem 110.141 false. 

If 77 is a heap on N = 2 l^s(^-i)l _ 3 nodes and 77' = 77.RemoveMax() is a 
worst-case heap with CRemoveMaxO (77) = {N) then 77' is a worst-case 

heap on N' = 21^^^-^'^ — 4 nodes that admits a lossless pull down. This would 
make Theorem 110.131 false. □ 

One could try to find a strategy that makes a move with a loss of 1 credit 


only when the Singularity Theorem 10.13 mandates a loss; such a strategy 
would automatically be optimal. Its loss of credit function A* would be de¬ 
fined by the equality (129) page 57 and visualized on Figure [Toj Unfortunately, 
such a strategy does not exist or otherwise, as a winning strategy for all N, 
it would produce infinitely many hereditary worst-case heaps while there are 
only 1017 of them, as it has b een demonstrated in [Appendix B However, 


10.13 


dictates that -^*(*) is a lower bound on 

As(i) for the first 77 — 1 moves of any strategy 


the Singularity Theorem 
the accumulated loss 
s. This fact, taking into account that, by virtue of Lemma 9.8 page 66 , the 
strategy win (TV) actually reaches the said lower bound, leads to a weaker 
criterion of optimality of a strategy, from which the optimality of win (TV) for 
TV follows. 


®^That equality is equivalent to this slightly longer condition: TV = 25s^l — 3 and 
N ^5. 
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^ A Lower Bound Lemma 10.15. For every strategy s and every N >2, 


N 


N 




(175) 


7=2 


7=2 


Proof by induction on N. For N < 12, 'Yl!i =2 ~ 0 thus the inequality 


(175) holds. This completes the basis step. 


For the inductive step, let us assume that the inequality (175) holds for 
some > 12 and every strategy s. I am going to show that for every strategy 

s, 

N+l N+1 

(176) 

i=2 i=2 

If + 1 7 ^ — 4 then, by (|l29[) page 


57, A*(A^ + 1) =0 and the 


inequality (176) follows from (175). 

If A^ + 1 = — 4 then, by (129), \*{N + 1) = 1. In such a case, 

let us assume to the contrary that there exists a strategy s such that 

N+l Af+1 N 

As(*) < E ^*(') = E ^*(') + 7- (177) 


7=2 


7=2 


7=2 


From (177) and (175) I infer Xs{N + 1) =0, which implies that 

N+l 


N 


N 


5^A.(*) = 5^A.(i)<5^A*(*) + l, 


7=2 


7=2 


7 = 2 


or 


N 


N 


5^A,(i)<5^A-(i), 


(178) 


7=2 


7=2 


and, by virtue of the Singularity Theorem 10.13 that the heap 
H = <^(si^Ar_i) m on A^ nodes that has been created by s’s hrst A^ — 1 


moves is not a worst-case heap. Thus, by Theorem 8A page s is not 
optimal for N and, therefore, for some strategy u, 

N N 

5;A.(i)>^V(i), (179) 


7=2 


7=2 


94 


The function tX' has been defined on page 


37 
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This, by virtue of (178), yields 

N N 

i=2 i=2 

and, therefore, contradicts the inductive hypothesis. 
The ^ A Lower Bound Lemma yields the following. 


□ 


Optimality Criterion 10.16. A strategy s is optimal for N > 2 if, and 
only if, 


N-l 


N-l 




i=2 


i=2 


95 


(180) 


Proof. By the Lemma |8.2| page s is optimal for > 2 if, and only 
if, for every strategy u. 


N-l N-l 


5^A.(i)<^A„(»). 


Thus if (180) holds then, by the ^ A Lower Bound Lemma 10.15, s is optimal. 


If s is optimal for N then, by the Lemma 8.2 


Af-l N-l 

^ ^ -^5(0 — ^ ^ Awin(V )(0 ~ 

1=2 i=2 


[by the equality (142) in the Theorem 9.8 page 66 


N-l 




which, by the ^ A Lower Bound Lemma 10.15, yields (180). 


□ 


The Optima lity Criterion 10.16 may be used to prove the Singularity 
Indeed, if if is a worst-case heap on _ 2 rig^l _ 4 nodes 


Theorem 10.13 


®®Which, by virtue of Lemma 


9.3 


page 


57 


is equal to max{ [lg(iV -|- 3)J, 3} — 3. 
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that admits a lossless pull down, H is the heap produced from H by that 
lossless pull down, and s is a strategy that produces H in its hrst N moves 
then, by the Optimality Criterion, 


Since As(iV) 


N-l N-1 

i=2 i=2 


0 and \*{N) 


1 , (181) implies that 


N N 


5^A.(i)<5^A-(i). 


(181) 


thus contradicting the Optimality Criterion. 


Therefore, if one takes a proof Lemma 9^ page ^ for granted, the Op¬ 
timality Criterion and the Singularity Theorem have proofs of roughly the 
same complexity since they can be easily derived one from another, as I have 
shown above. 


Providing a straightforward proof of the Singularity Theorem 10.13 that 

(^nor on Theorem 


does not depend on The 1st Optimality Theorem 10.11 


10.9 that was instrumental in proving it) would constitute a signihcant sim- 


plihcation of my proofs of the main results of this paper. 


11. The worst-case number of comparisons for RemoveAll 

At this point, I have all the facts needed to conclude the fundamental 
result of this paper; a formula that gives the exact number of comparisons of 
keys performed in the worst case by the RemoveAll(), visualized on Figure [24| 
page[^ 

Theorem 11.1. For every natural number N > 2, the number 
of comparisons of keys performed in the worst case by the H.RemoveAll() on 
any heap H of size N is equal to: 

2{N - 1) [lg(iV - 1)J - 2 L'g(^-bJ +2 + inin( [lg{N - 1)J, 2) + 4 c, (182) 
where c is a binary function on the set of integers defined by: 

( 1 if N < 2rig^l -4 


c = 


0 otherwise. 


(183) 
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Proof. By The 2"'^ Optimality Theorem 


strategy for N, so, by Theorem 8.3 


page 


51 


by the formula (145) in the Theorem 9.10 page 68 , for its hrst N — 1 moves is 


10.12 page 87, win(A^) is an optimal 


the payoff Pwin(v)(77 - 1 ), given 


equal to the number of comparisons of keys performed in the 

worst case by the H.RemoveAll() on any heap H of size N. Since the formulas 
(182) and (145) are identical, the thesis of this theorem follows. □ 


Example of a 500-node worst-case heap for RemoveAll, created by my 
Java program, is included in [Appendix A. 3] page [TTO 


12. The worst-case number of comparisons for Heapsort 

Adding the formulas for the exact numbers of comparisons of keys per¬ 
formed in the worst case by the MakeHeap and RemoveAll() yields the exact 
numbers of comparisons of keys performed in the worst case by the Heapsort 


visualized on Figure 31 



Figure 31: The worst-case number C'Heapsortl-^) comparisons of keys by Heapsort. 


Theorem 12.1. For every N >2, 

2(iV - 1) [Ig iV] - 2 r'g^l+^ - 2s2{N) - 62 (iV) + min( [Ig Af], 3) + 5 + c, (184) 
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where c is a binary function on the set of integers defined by: 

1 if N < -4 

0 otherwise. 


c = 


Moreover, ifS<N = then the formula (ISf) simplifies to 


(2]V-3)lg^+3« 


(185) 


Proof of (184) is a direct application of Theorem 11.1 page 91 to the equal¬ 
ities (61) page 30 and ((74j) page 


32 


li N = 2^1®^! then [IgiV] = IgA^, S 2 {N) = 1, e 2 {N) = IgN, and c = 0, 

Ig iV -I- min(lg N, 3) + 5, 


so that (184) simplihes to 

2{N -l)\gN -2N -2 

or to 


N 

{2N - 3) Ig Y min(lg A^, 3). 


If, moreover, N > 8 then min(lgA^, 3) = 3. Hence, in such a case, (184) is 
equal to (185). □ 


The following theorem uses a version of (184) that reduces the impact 
of the non-continuous function ceiling in the formula (184) to a small and 
continuous term e. It allows for easier evaluation of the exact rate of growth 
of its terms as well as comparisons with some other sorting-related formulas 
that are expressible with e. 


96lf = 2 or TV = 4 then the the said formula simplifies to 2{N — 1) Ig ^ 1. 

®^For instance, as indicated in [5], p. 192, the minimum external path length eplrainim) 
in a finite binary tree with m leaves (treated as external nodes) is given by: epl,ain{m) = 
m{\gm + e{m)), where e{x) is the function visualized on Figure 32 and defined in the 
proof of Theorem |12.2[ thus yielding this information-theoretic lower bound on the average 
number of comparisons of keys performed by any decision-tree-sorting algorithm: Ig N\ -f 
e{N\) and this average number of comparisons of keys performed by successful binary 
search: ^'^(lg(TV-f 1)-|-e(A^-|-1))-|-1. 
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The Main Theorem 12.2. For every natural number N > 2, the number 
of comparisons of keys performed in the worst case by the Heapsort on any 
array of size N is equal to: 

N -I 

2{N-l) (Ig —^-h£)-2s2(iV)-e2(iV) + min([lg(A^-l)J,2) + 6 + c, (186) 

where e, given by: 

e = 1 + 6 - 2^ and e = \\g{N - 1)] - Ig (iV - 1), 


is a continuous function of N on the set of reals > 1, with the minimum 
value 0 and and the maximum (supremum if e is restricted to integers) value 

5 = 1 - Ige + Iglge ^ 0.08607133205593420 


S 2 {N) is the sum of all digits of the binary representation of N, e 2 {N) is the 
exponent of 2 in the prime factorization of N, and c is a binary function on 
the set of integers defined by: 


c = 


1 if N < 2rig^l -4 
0 otherwise. 


(187) 


Moreover, if N > 5 then the formula (186) simplifies to: 


2(iV-l)(lg 


N-1 


+ e:) - 2 s 2 {N) - e 2 {N) + 8 + c. 


(188) 


A graph of function c is visualized on Figure 
of function £ is visualized on Figure [3^ 


page 


68 


while a graph 


®®The constant 

1 - Ig e + Ig Ig e « 0.0860713320559342 

has been known as the Erdos constant 6. Erdos used it around 1955 in order to establish 
an asymptotic upper bound for the number M (k) of different numbers in a multiplication 
table of size k x k hy means of the following limit: 


In 


kxk 

M{k) 


k-^oo lnln(fc X k) 


= 6 . 
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0.0860713 



0 0.471234 1 1.47123 2 


Figure 32: Graph of €(x) = 1 + [y] — y — 2^^^^ ^ as a function of y = \gx. It assumes 
the maximum of 1 — Ige + Iglge « 0.0860713320559342 for y = n + 1 — Ig Ig e « n + 
0.4712336270551024 and any integer n. 


Proof, given Theorem 12.1 is elementarily algebraic and most of its com¬ 
ponents had been analyzed in works of others, for instance in |3]. I present 
it here in its entirety for the sake of completeness and reader’s convenience. 


First, I am going to show that for every x > 0, 

x\lgx] — = x(lgx -1- e{x) — 1), 


(189) 


where £ is given by: 

e{x) = 1 + 6 — 2^ and 6 = [Ig x] — Ig x. 

Indeed, substituting definition of 6 to the definition of e, we obtain: 
e{x) = 1 -I- [Ig x] — Ig X — 2^^^ 


or 


or 


or 


2 rig 

Ig X + e{x) - 1 = [Ig x] - 

2 rig 

Ig X -r- e(x) - 1 = [Ig xl-, 

X 

x(lg X -1- e{x) — 1) = x[lg x] — 


which proves (189). 

Now, putting X = TV — 1, one gets 

(Af-i)(ig(iv-i) + £-i) = (iv -1) [Ig (iv -1)] 


Observation that 


(Af - 1) [Ig (iV - 1)1 ^ (jv- l)[lg N] -2ri§^l 


(190) 
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[if N -1 = 2 rig (^-1)1 then [Ig N] = [Ig (iV - 1)] + 1 = Ig (A^ - 1) + 1 and 


both sides of the equality (190) reduce to {N — 1) lg(iV — 1) — + 1, while 


if A^ — 1 7 ^ Id then [Ig A^] = |'lg(A^ — I)] and the equality (190) is 

obviously true] allows one to derive: 

2(Ar-l)[lgAr] _ 2 dgivi+i =2{{N -l)\\gN] -2dg^T) = 

= 2((Ar-l)[lg(Ar-l)] _2r'g(i^-bl) = (iV-l)(lg(Ar-l) + e-l) = 

A^ - 1 

= (Ar-l)(lg^—+e), 


from which one concludes that (184[) is equal to (186) 


□ 


For N > 5, min([lg(A^ — 1)J, 2) + 6 = 8, so (186) is equal to (188). 

Example of a 500-node worst-case array for Heapsort, created by my 
Java program, is included in Appendix A.3 page 110 

Note. Function £ (a:) = 1 + [Ig x] — Ig X — 2 rig 'g ^ visualized on Figure 

has been briefly analyzed in [3]. It assumes the maximum 1 — Ige -1- Iglge ~ 
0.0860713320559342 for Igx = n + Iglge and any integer n, that is, for x 
= 2"’2igig'^ = 2”'Ige. Since In2 and, therefore, Ige are irrational number^^ 
so is X. Therefore, function e restricted to integers never reaches the value 
1 — Ige -|- Iglge. However, one can easily show that lim^^oo £([2"'Igej) = 
1 - Ige-fig lge( 


100 


which makes 1 — Ig e -|- Ig Ig e the supremum of e restricted 


to integers. 


13. Logarithm-based tight upper bounds on 2s2{N) -|- e 2 {N) 


A jumpy function f{N) = 2s2{N) -\-e 2 {N) that appears in formulas ( |184 ) 
and (186) oscillates between 4 and 21g(A^ -|- 1) as A^ ranges between 3 and 
oo, assuming 4 for A^ = 2 dg^J -|- 1 and 21g(77 -|- 1) for N = 2 dg^l — 1 . This 
yields: 

4 < 2s2(A^) + e2iN) < 2 lg(Ar + 1 ). (191) 


®®Here, I only use the fact that Ige does not have finite binary representation. 

^°°Indeed, 0 < 2" Ige — [2"lgeJ < 1, while lim„_,,oo 2" Ige = oo and e{x) is a con¬ 
tinuous function differentiable on its domain minus the countable set of a: = 2^*®^^ 
and {^) = Oj so that the lim„_>oo(e(2"Ige) — e([2”lgej)) = 0. Thus 

lim„^oo e( [2” Ig ej) = lim„^oo e(2" Ig e) = 1 ~ Ig e -f Ig Ig e. 
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Using recurrence relations for 62 (iV) 


e2{N) 


e 2 (f) + l if 2 I N 
0 otherwise 


and S 2 {N) 


S2{N) 


1 if AT = 1 

'52(LtJ) N%2 otherwise 


one can derive a recurrence relation for f{N) 


f{N) = 2 + 


IgiV if iV = 2Lig^J 

f{N — otherwise. 


(192) 


This leads to another (tighter than (191), if < N < 2^^^^'^^ — 1, and 

continuous except for N = + 1 ) upper bound on f{N) shown on 

Figure |33j 



Figure 33: Graphs of 2s2(iV) + e 2 (iV) and its upper bound 21g(A^ — + 1) + 2. 

For every N >2: 

2s2{N) + e2(iV) < 2 lg(iV - + 1) + 2, (193) 

and for every N 7 ^ 21^^®^^: 

2 lg(iV + 1 - 2L1 s(^-^)J) + 2 < 2 lg(Af + 1). 


( 194 ) 
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If — 1 (the case considered in [B]) then 21g(A^ + l —^^-l)+2 = 

21g(iV + l); for all other ^ 2^18^1, 21g(Ar + l-2Lig(^-i)J) + 2 < 21g(Ar + l). 

If 2 < AT = 2^18^1 then 21g(Ar - + 1) + 2 = 21g(Ar + 2) > 


21g(iV + l). 

Let E = {2^ 


k>3}C {2™(2^ + 1) I A;,m > 1}. For 2 < A^ ^ E, (192) 


may be farther expanded to yield even tighter than (|193) npper bonnd 


2 lg(Ar - + 1) + 2 (195) 


on f{N), visnalized on Fignre 34 



Figure 34: Graphs of 2s2{N) + e 2 {N) and its upper bound 21g(A^ — 24s(^ i)J — 
2 Lw- 2 L's(” i)Jj 2 £qj. almost all N. 


The size of the set E^^ = EC {0, ...,N — 1} of exceptions less than N is 
small relative to N. It is clear that for every i E E, S 2 {i) < 2. The nnmber of 
nnmbers i < N with S 2 {i) = 1 is between [IgAfJ and [IgAf], and the nnmber 
of nnmbers i < N with S 2 {i) = 2 is between |([lgAfJ + l)([lgAfJ) and 
Id'lgAf] + l)(|'lgAf]), so that the nnmber of nnmbers i < N with S 2 {i) = 1 
or S 2 {i) = 2 is between |([lg AfJ +3)([lg AfJ) and |(|'lg Af] +3)(|'lg AQL Thns 
the size of E"^^ is 0(log^ N). In particniar, the fnnction given by (195) is an 
npper bonnd on f{N) for almost all A^d^ 


^°^As liniAT-i-oo 


r(rigjvi+3)(rigjvi) 

N 


= 0 . 
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4 


2 


0 


-2 



X 



-4 


Figure 35: A graph of the difference between the npper bonnd 21g(7V — — 

2 Af- 2 Le( )j _|_ _|_ 2 for almost all N and f{N) = 2s2{N) + e 2 {N). Points below the 

N-axis indicate elements of the set E = {2^ | fc > 3} U {2"‘(2^ + 1) | fc,m > 1} for which 
the former is not an upper bound of the latter. 


A graph of the difference between the upper bound (195) and f{N) is 
visualized on Figure [35j 


14. A note about the roots of this work 

Ever since I learned Heapsort, I have always regretted that I had not 
been given a chance to invent it. I thought that a person commemorated 
with the epitaph 

“Here rests He who invented heaps” 

should consider himself lucky. But, at last, I got my chance to precisely 
characterize the worst-case behavior of it, and I couldn’t let it pass. And I 
didn’t. 

I begun looking for a textbook on Analysis of Algorithms with an ex¬ 
act closed-form formula for the worst-case number of comparisons done by 
Heapsort sometime in late 2000s, only to discover that no one seemed to 
know it. For instance, Cormen et al. jSj had some close estimate of that 
number but not the exact formula. I was able to derive such a formula 
for MakeHeap in the Fall 2010. The paper with my derivation took a long 





M. A. Suchenek: A Complete Worst-Case Analysis of Heapsort (MS) 100 


path to print and appeared in the Snmnier 2012, more than a year after 
the same result (albeit with a totally different derivation) was published by 
Paparrizos |8]. I spent parts of the Summers 2012 and 2013 on pinpoint¬ 
ing the formula for the entire Heapsort , and “discovered” a readable copy 


of Kruskal and Weixelbaum’s old report with somewhat sketchy proof of a 
worst-case formula for a special case of iV = 2^^*^^^ — 1 in mid-July 2013 while 
waiting at a service station for a repair of my car and killing time by surhng 
the Internet, after I had hnished all the details of my early proofs of the said 
formula. 


I was amazed by striking similarities between their work and mine. Al¬ 
though we all learned from Knuth’s writings or from writings of those who 
learned from Knuth’s, so any similarities here are not totally coincidental 
even though my knowledge of The Art of Programming is rather spotty and 
often (like in the case the idea of running Heapsort backwards) ex post facto, 
the degree of the said similarities made me wonder if anyone of us who takes 
on certain kind of problems is destined to end up, eventually, on a similar 
path leading to similar results. It does feel, indeed, as if the proof of the for¬ 
mula for the worst-case number of comparisons done by Heapsort was out 
there, somehow independently of our intellectual inquiries, like the gravity 
and the Sun storms, just waiting for somebody to discover it. 


If.l. Comparison with Kruskal-Weixelhaum formulas 

Below are comparisons of my results with formulas published in [6] that 
gave the actual worst-case numbers of comparisons for the special case of 
compete heaps (of size N = 21^^®^^ — 1, that is). As one can see, Kruskal- 
Weixelbaum’s formulas, if extended over all N > 2 happen to give a lower 
bound for the number of comparisons done in the worst case by MakeHeap 
also for other cases of N (Figureleft), but not by RemoveAll (Figure 
left). 

The difference between the actual worst case for RemoveAll and the Kru¬ 
skal-Weixelbaum’s lower bound diverges and its limit superior is -|-oo, as 
Figure [3^ (right) illustrates it. 


took me part of Spring and the Summer 2014 to clean up its derivation and simplify 
the optimality proof. 

103Which was not the intention of their authors. 
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Figure 36: (Left:) Comparison of the actual worst-case for MakeHeap (top line) with 
Kruskal-Weixelbaum’s formula (bottom line). (Right:) The difference between the actual 
worst-case for MakeHeap and Kruskal-Weixelbaum’s lower bound. 


The Kruskal-Weixelbaum’s worst-case formula for RemoveAll coincides 
(the exact value) for iV = — 1 and N ^ + 


with (182) on page 


91 


^(iV) for some ^{N) G (1.4,1.8), where N > 7. 

Interestingly, Kruskal-Weixelbaum’s formulas also give a lower bound for 
the number of comparisons in the worst case for the entire Heapsort (Fig¬ 
ure!^, left) although they overestimate that number for RemoveAll for some 


^ ^ — 1, for instance, for N = 21^^^^^^ and even if c = 1, for instance, 

for N = -|- 1. One could speculate that this was a reason why they 

stopped short of deriving the formulas for arbitrary N > 2, not just for 
TV = 2^1®^! - 1. 

The difference between the actual worst case for Heapsort and the Kru¬ 
skal-Weixelbaum’s lower bound diverges and its limit superior is -|-oo, as 
Figure [M| (right) illustrates it. 

Moreover, Kruskal-Weixelbaum proof of decomposition of the worst-cases 
of Heapsort (Theorem 2 and Remark 1 in |6]) works only for iV = 2^^'^'^^ — 1. 
Unlike my program unFixHeap on Figure [4| page [Mf the worst-case input 
generator for Makeheap presented in [^j works for input sizes N = — 1 

but not for other sizes, except, incidentally. For instance, it doesn’t work 
for = 12 as their algorithm (Figure 2 in [6]) does not generate a worst- 
case array (e.g., one visualized on Figure]^ page 31) for the worst-case heap 
visualized on Figurepage IT As the hrst step, their algorithm will “unsift” 
the root 12 at index 1 of the said heap all the way down to the index 12 
of the leaf 1, which happens to be the only node of the last level without a 
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Figure 37: (Left:) Comparison of the actual worst-case for RemoveAll (the crooked line) 
with Kruskal-Weixelbaum’s formula (the smooth line). The latter overestimates the fromer 
for N = 4,5,8,9,16 and 17. (Right:) The difference between the actual worst-case for 
RemoveAll and Kruskal-Weixelbaum’s lower bound. It is negative for for iV = 4, 5, 8, 9, 
16, 17, 32, 33, 64, 65, 128, 129, 256 and 257. 


sibling. As a result, the corresponding “sift” will, by the equality (85) page 38 
perform 2[lg 12j — 1 comparisons while the maximum, given by (89) and the 


right-hand side of the inequality (88) page 38 is [lgl2j -|- [IgllJ = 2[lgl2j. 
Hence the “reverse heap” created of the said heap by their algorithm is not 
a worst-case array for the MakeHeap. 

The PROCEDURE UNSIFT(S, BOUND) of their algorithm is functionally very 
close to Java method PullDown(i, j) of Fig. on page 2( 

BOUND for i and S for j. Should they use “the leftmost leaf 
rooted by P” rather than “node containing smallest value in the 
tree rooted by P”, their algorithm would be functionally equivalent to my 
program and would correctly generate worst-case arrays for any N > 2 for 
MakeHeap. 


substituting 
in the tree 


So, how close were they to discovering the general formulas for any N >21 

The formula for the worst-case number of comparisons for RemoveAll 
that they derived in j6] (Theorem 3) for N = — 1 is equivalent to this 

one: 

2{N - 1) lg{N + 1) - 4(Ar - 1) + min(lg(Ar + i) _ 2). 

Should they try to derive a formula for N = 2^^®'^-! J-l, insteacf^ they would 


Another possibility would be playing with the strategy z{N) of Example 
N = - 3. 


10.10 


for 
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Figure 38: (Left:) Comparison of the actual worst-case for Heapsort (top line) with 
Kruskal-Weixelbaum’s formula (bottom line). (Right:) The difference between the actual 
worst-case for Heapsort and Kruskal-Weixelbaum’s lower bound. 


obtain a formula equivalent to this one: 


2{N - 1) lg(iV - 1) - 4(Af - 1) + 4 + min(lg(iV - 1), 2). 


(196) 


Interpolating the hrst part 2(iV — 1) lg(Af — 1) — 4(iV — 1) -t-4 of (196) between 
the points iV = 21-^®^-! -|- 1, as it is visualized on Figure they could “guess” 
the exact value of the hrst part of the payoff Ppar(-^ — 1) for the hrst N — 1 
moves of the strategy par, that is, 

2{N - 1) [lg(iV - 1)J - + 4. 


‘Guessing” that lg(iV — 1) in min(lg(iV — 1),2) in (196) should really be 


[lg(Af — 1)J would allow them to arrive at the correct formula for Ppar(Af — 1). 
After that, the only missing part of the actual formula for C^'^l-^-^{N) is the 
function c dehned by the equation (183) on page 91, which does not seem 
like an obvious “guess”. In order to compute c, control of the index of node 
4, which Kruskal-Weixelbaum did not do, in construction of a general worst- 


case heap, as indicated by the Worst-case Heap Characterization Lemma 10.8 
page[^ seems critical. And, of course, the proof that all the above are correct 
guesses would be nedded, too, and that would be, perhaps, the hardest part. 


Any “guessing” of the formula (74) on page 32 


2N - 2s 2{N) - e2{N) 


for C'^SeHeap(^) froHi 2N — 2 lg(Af -|- 1) (formula derived in [6], correct for 
N = 2^^^—1) or from 2N — 4 (correct for Af = -|- 1) by interpolation 
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Figure 39: The function 2{N — l)[lg(fV— 1)J — 24s(^ i)J+2 _|_4 line) interpolates the 
function 2{N — 1) lg(A^ — 1) — 4(7V —1) + 4 (bottom line) between the points iV = 2L*8^J +1. 


or otherwise, seems out of the question if one takes into account the jumpy 
behavior of 2s2{N) + 62 (iV) as analyzed in Section 13 and visualized on 


Figure [33] page 97 


For 15 < iV = — 1, Kruskal-Weixelbaum’s 35-years olc^^ formula 

gives a lesser (better, that is) value than one that I somewhat hastiljj^ 
called, in the Abstract and in the opening sentence of Section 9 of the 
“best-known upper bound” on the number of comparisons of Heapsort. 


the time of publication of this paper. 

After “exhaustively” testing it for all permutations of up to 10 first positive integers; 
the test for all permutations of 13 would let me realize my mistake just in four months or 
so of running it on my laptop computer. 
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APPENDIX 
Appendix A. Examples 

Appendix A.l. Example of construction of a worst-case 12-element heap us¬ 
ing strategy win(12) 

The following is a compilation of excerpts from an output of my Java 
program that (among other things) visualizes construction of a worst-case 
heap of 12 elements of Figures [T||^ and on pages [T7| through [M] using the 
strategy win(12). It shows a sequence of 12 heaps (a game), each printed 
level-by-level, and the first 11 pull downs of win(12) that are applied to the 
the first 11 of these heaps. All the pull downs are lossless. There is no 
lossless pull down after the last one in this sequence. The only two valid 
moves that are applicable to the last heap are pull down 1, which pulls down 
a leaf without a sibling, and pull down 2, which pulls down the parent of the 
former. They both are lossy because, by the equality (85) on page 38, they 
yield a credit 

2 Llgl2j -1= Llgl2j + [IgllJ -1 
and this is less than the maximum credit 

[lgl2j + [IgllJ 


given by (89) and the right-hand side of the inequality (88) on page 38 for 


pulling down a node in a 12-element heap. (After all, by virtue of Theorem 


10. 9| on page [83] substituting 3 for m, every sequence of more than 11 con¬ 
secutive pull downs the first of which is applied to the 1-element heap must 
be lossy.) 


Level 0: 1 

H[l] -> H[2] 

Level 0; 2 

Level 1; 1“ 


H[2] 


> H[3] 
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Level 0: 3 

Level 1: 2~1 

H[3] -> H[4] 

Level 0: 4 

Level 1: 2 3 

Level 2: 1" 

H[4] -> H[5] 

Level 0: 5 

Level 1: 4 3 

Level 2; 2"1 

H[4] -> H[6] 

Level 0: 6 

Level 1: 5 3 

Level 2: 4"! 2 ~ 

H[5] -> H[7] 

Level 0: 7 

Level 1: 6 3 

Level 2: 4-5 2"! 

H[7] -> H[8] 

Level 0: 8 

Level 1: 6 

Level 2: 4 5 

Level 3: 1" 

H[4] -> H[9] 

Level 0: 9 

Level 1: 8 

Level 2: 6 5 

Level 3: 1-4 

H[8] -> H[10] 

Level 0: 

Level 1: 9- 


—7 

2—3 


—7 

2—3 


10 
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Level 2; 8 5 2 3 

Level 3: 6 4 1 

H[9] -> H[ll] 


Level 0: 11 

Level 1: 10 7 

Level 2: 9 5 2 3 

Level 3: 6 8 1 4 


H[10] -> H[12] 

Level 0: 12 

Level 1: 11.7 

Level 2: 9 10 2 3 

Level 3: 6—8 5—4 I" 

The strategy was the first 11 pull downs of win(12): 
<1, 1, 1, 1, 2, 1, 1, 4, 1, 4, 1> 

BUILD SUCCESSFUL (total time; 0 seconds)} 


Appendix A.2. Example of construction of the last level a worst-case 30- 
element heap using strategy win(30) 

The following is an annotated compilation of excerpts from an output of 
my Java program that visualizes construction of the last level of a worst-case 
heap of 30 elements. It shows a sequence of 16 heaps (a subgame), each 
printed level-by-level, and the hrst 15 pull downs of win(30) that are applied 
to the the hrst 15 of these heaps. All the pull downs, except for 14th pull 
down that incurs a loss of 1 credit, are lossless. 

Win(30) - annotated excerpt from the output of my Java program 


Level 0: 15 

Level 1: 12-14 

Level 2: 9-11 13-3 

Level 3: 4—6 10—5 7—8 2—1 


Par ended. Greedy begins here. 


Level 0: 
Level 1: 


12 


16 


15 
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Level 2; 9-11 13-14 

Level 3: 4- 6 10-5 7- 8 2-3 

Level 4: 1 

Level 0; 17 

Level 1: 16-15 

Level 2: 12-11 13-14 

Level 3: 9- 6 10-5 7- 8 2-3 

Level 4: 1 4 

Level 0: 18 

Level 1: 17-15 

Level 2: 16.11 13".14 

Level 3; 12. 6 10".5 7. 8 2".3 

Level 4: 9 4 1 

Level 0: 19 

Level 1: 18-15 

Level 2: 17-11 13-14 

Level 3: 16- 6 10-5 7- 8 2-3 

Level 4: 9—12 1—4 

Level 0: 20 

Level 1: 19-15 

Level 2: 18.11 13.14 

Level 3: 16.17 10.5 7. 8 2".3 

Level 4: 9—12 6—4 1 — 

Level 0: 21 

Level 1: 20-15 

Level 2: 19-11 13-14 

Level 3: 16-18 10-5 7- 8 2-3 

Level 4: 9—12 6—17 1—4 

Level 0: 22 

Level 1: 21-15 

Level 2: 19-20 13-14 

Level 3: 16".18 11.5 7. 8 2".3 

Level 4: 9—12 6—17 10—4 1 — 

Level 0: 23 

Level 1: 22-15 

Level 2: 19-21 13-14 

Level 3: 16-18 20-5 

Level 4: 9—12 6—17 10—11 1—4 


7 


—8 


2 " 


—3 
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Level 0: 24 

Level 1: 23-15 

Level 2: 19-22 13-14 

Level 3; 16.18 20.21 7.8 2.3 

Level 4: 9—12 6—17 10"11 5—4 1— 

Level 0: 25 

Level 1: 24-15 

Level 2: 19-23 13-14 

Level 3: 16-18 20-22 7-8 2-3 

Level 4: 9—12 6—17 10—11 5—21 1—4 

Level 0; 26 

Level 1: 24-25 

Level 2: 19-23 15-14 

Level 3: 16-18 20-22 13-8 2-3 

Level 4: 9—12 6—17 10—11 5—21 7—4 1 — 

Level 0: 27 

Level 1: 24-26 

Level 2: 19-23 25-14 

Level 3; 16-18 20-22 15-8 2-3 

Level 4: 9—12 6—17 lO"11 5—21 7—13 1—4 

Level 0: 28 

Level 1; 24-27 

Level 2: 19-23 26-14 

Level 3: 16-18 20-22 15-25 2-3 

Level 4: 9—12 6—17 10—11 5—21 7—13 8—4 1 — 


No lossless pull down is possible at this point 


Level 0; 29 

Level 1: 24-28 

Level 2; 19-23 26-27 

Level 3: 16-18 20-22 15-25 14-3 

Level 4; 9—12 6—17 10—11 5—21 7—13 8—4 2—1 

Level 0: 30 

Level 1: 24-29 

Level 2: 19-23 26-28 

Level 3: 16-18 20-22 15-25 27-3 

Level 4: 9—12 6—17 10—11 5—21 7—13 8—4 14 —1 2" 
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Appendix A.3. Example of a 500-element worst-case input for Heapsort and 
heap for RemoveAll 

Below is a 500-element worst-case input array A for the Heapsort gener¬ 
ated by my Java program unMakeHeap run on the worst-case heap iJsoo for 
RemoveAll. iJsoo had been created by my Java implementation of win(iV), 
and is shown below, after A. 

65, 133, 7, 192, 10, 128, 14, 260, 17, 160, 5, 223, 8, 224, 29, 269, 24, 144, 6, 
191, 11, 176, 21, 388, 13, 208, 25, 254, 15, 240, 60, 286, 39, 136, 9, 287, 18, 
152, 36, 324, 20, 168, 40, 351, 22, 184, 44, 397, 16, 200, 32, 415, 26, 216, 52, 
452, 28, 232, 56, 479, 30, 248, 123, 319, 70, 132, 12, 271, 33, 140, 67, 292, 35, 
148, 71, 303, 37, 156, 75, 333, 23, 164, 47, 335, 41, 172, 83, 356, 43, 180, 87, 
367, 45, 188, 91, 414, 27, 196, 55, 399, 49, 204, 99, 420, 51, 212, 103, 431, 53, 
220, 107, 461, 31, 228, 63, 463, 57, 236, 115, 484, 59, 244, 119, 494, 1, 249, 2, 
384, 34, 256, 19, 263, 48, 264, 130, 276, 66, 272, 134, 279, 68, 280, 138, 301, 
38, 288, 78, 295, 72, 296, 146, 308, 74, 304, 150, 311, 76, 312, 154, 350, 42, 
320, 86, 327, 80, 328, 162, 340, 82, 336, 166, 343, 84, 344, 170, 365, 46, 352, 

94, 359, 88, 360, 178, 372, 90, 368, 182, 375, 92, 376, 186, 447, 50, 255, 102, 

391, 64, 392, 194, 404, 98, 400, 198, 407, 100, 408, 202, 429, 54, 416, 110, 
423, 104, 424, 210, 436, 106, 432, 214, 439, 108, 440, 218, 478, 58, 448, 118, 

455, 112, 456, 226, 468, 114, 464, 230, 471, 116, 472, 234, 493, 62, 480, 126, 

487, 120, 488, 242, 495, 122, 61, 246, 251, 124, 250, 3, 500, 96, 159, 142, 259, 
79, 258, 257, 268, 129, 262, 261, 267, 131, 266, 265, 285, 69, 270, 141, 275, 

135, 274, 273, 284, 137, 278, 277, 283, 139, 282, 281, 318, 73, 158, 149, 291, 

143, 290, 289, 300, 145, 294, 293, 299, 147, 298, 297, 317, 77, 302, 157, 307, 

151, 306, 305, 316, 153, 310, 309, 315, 155, 314, 313, 383, 81, 174, 165, 323, 

95, 322, 321, 332, 161, 326, 325, 331, 163, 330, 329, 349, 85, 334, 173, 339, 

167, 338, 337, 348, 169, 342, 341, 347, 171, 346, 345, 382, 89, 190, 181, 355, 

175, 354, 353, 364, 177, 358, 357, 363, 179, 362, 361, 381, 93, 366, 189, 371, 

183, 370, 369, 380, 185, 374, 373, 379, 187, 378, 377, 499, 97, 206, 197, 387, 

111, 386, 385, 396, 193, 390, 389, 395, 195, 394, 393, 413, 101, 398, 205, 403, 

199, 402, 401, 412, 201, 406, 405, 411, 203, 410, 409, 446, 105, 222, 213, 419, 

207, 418, 417, 428, 209, 422, 421, 427, 211, 426, 425, 445, 109, 430, 221, 435, 

215, 434, 433, 444, 217, 438, 437, 443, 219, 442, 441, 498, 113, 238, 229, 451, 

127, 450, 449, 460, 225, 454, 453, 459, 227, 458, 457, 477, 117, 462, 237, 467, 

231, 466, 465, 476, 233, 470, 469, 475, 235, 474, 473, 497, 121, 253, 245, 483, 
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239, 482, 481, 492, 241, 486, 485, 491, 243, 490, 489, 496, 125, 252, 4, 247. 

Below is a 500-nodes worst-case heap H^qq, shown in the left-to-right level- 
by-level order, for RemoveAll. It was created by the hrst 499 moves of (my 
Java implementation) of win(500). i^soo is equal to the heap constructed by 
MakeHeap run on the input array A visualized above. 

500, 384, 499, 319, 383, 447, 498, 286, 318, 350, 382, 414, 446, 478, 497, 269, 

285, 301, 317, 333, 349, 365, 381, 397, 413, 429, 445, 461, 477, 493, 496, 260, 

268, 276, 284, 292, 300, 308, 316, 324, 332, 340, 348, 356, 364, 372, 380, 388, 

396, 404, 412, 420, 428, 436, 444, 452, 460, 468, 476, 484, 492, 495, 251, 192, 

259, 263, 267, 271, 275, 279, 283, 287, 291, 295, 299, 303, 307, 311, 315, 191, 

323, 327, 331, 335, 339, 343, 347, 351, 355, 359, 363, 367, 371, 375, 379, 223, 

387, 391, 395, 399, 403, 407, 411, 415, 419, 423, 427, 431, 435, 439, 443, 254, 

451, 455, 459, 463, 467, 471, 475, 479, 483, 487, 491, 494, 247, 249, 250, 133, 

159, 256, 258, 132, 262, 264, 266, 136, 270, 272, 274, 140, 278, 280, 282, 144, 

158, 288, 290, 148, 294, 296, 298, 152, 302, 304, 306, 156, 310, 312, 314, 160, 

174, 320, 322, 164, 326, 328, 330, 168, 334, 336, 338, 172, 342, 344, 346, 176, 

190, 352, 354, 180, 358, 360, 362, 184, 366, 368, 370, 188, 374, 376, 378, 128, 

206, 255, 386, 196, 390, 392, 394, 200, 398, 400, 402, 204, 406, 408, 410, 208, 

222, 416, 418, 212, 422, 424, 426, 216, 430, 432, 434, 220, 438, 440, 442, 224, 

238, 448, 450, 228, 454, 456, 458, 232, 462, 464, 466, 236, 470, 472, 474, 240, 

253, 480, 482, 244, 486, 488, 490, 248, 252, 61, 246, 123, 124, 2, 3, 65, 96, 34, 
142, 70, 79, 19, 257, 39, 129, 48, 261, 12, 131, 130, 265, 24, 69, 66, 141, 33, 
135, 134, 273, 9, 137, 68, 277, 67, 139, 138, 281, 17, 73, 38, 149, 35, 143, 78, 
289, 18, 145, 72, 293, 71, 147, 146, 297, 6, 77, 74, 157, 37, 151, 150, 305, 36, 
153, 76, 309, 75, 155, 154, 313, 10, 81, 42, 165, 23, 95, 86, 321, 20, 161, 80, 
325, 47, 163, 162, 329, 11, 85, 82, 173, 41, 167, 166, 337, 40, 169, 84, 341, 83, 
171, 170, 345, 5, 89, 46, 181, 43, 175, 94, 353, 22, 177, 88, 357, 87, 179, 178, 
361, 21, 93, 90, 189, 45, 183, 182, 369, 44, 185, 92, 373, 91, 187, 186, 377, 7, 
97, 50, 197, 27, 111, 102, 385, 16, 193, 64, 389, 55, 195, 194, 393, 13, 101, 98, 
205, 49, 199, 198, 401, 32, 201, 100, 405, 99, 203, 202, 409, 8, 105, 54, 213, 
51, 207, no, 417, 26, 209, 104, 421, 103, 211, 210, 425, 25, 109, 106, 221, 53, 
215, 214, 433, 52, 217, 108, 437, 107, 219, 218, 441, 14, 113, 58, 229, 31, 127, 
118, 449, 28, 225, 112, 453, 63, 227, 226, 457, 15, 117, 114, 237, 57, 231, 230, 
465, 56, 233, 116, 469, 115, 235, 234, 473, 29, 121, 62, 245, 59, 239, 126, 481, 
30, 241, 120, 485, 119, 243, 242, 489, 60, 125, 122, 4, 1. 

The 499 pull downs that produced the above heap were: 

( 1 , 1 , 1 , 1 , 2 , 1 , 1 , 2 , 1 , 2 , 2 , 1 , 2 , 1 , 1 , 2 , 1 , 2 , 2 , 1 , 2 , 1 , 2 , 1 , 2 , 1 , 2 , 1 , 2 , 
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1, 

1, 2, 

1, 

2, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 1, 

1, 

2, 

1, 

2, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

1, 2, 

1, 

2, 

2, 

1, 

2, 

1, 

2, 1, 

2, 1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 1, 

2, 

1, 

2, 

1, 

2, 

1, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1,4, 

1,4, 

1, 

4, 

1, 

4, 

1, 

4, 

1,4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 4, 

1, 

4, 

1, 

4, 

1, 

4, 

1, 4, 

1,4, 

1) 










It took MakeHeap 986 comparisons to construct H^qq from the worst-case 
array A shown above, and RemoveAll 6,967 comparisons to deconstruct it, 
for a total of 7,953 comparisons to Heapsort the input array A. The total 
time my Java program took for creation of the input array A, which included 
creation of heap and sorting it, was less than 1 second under Netbeans 
IDE 6.9 on a Dell Lattitude E5510 laptop computer with Intel® Core™ i5 
2.40GHz processor, running Ubuntu 10.10 operating system. With all the 
diagnostics, dumping all the subheaps hxed and constructed (the time spent 
on which was 0(iV^ logiV)), and ornamental overhead, the total time was 24 
seconds 


Appendix B. Hereditary worst-case heaps 


Hereditary worst-case heaps for RemoveAll () are dehned as worst-case 
heaps whose all residua are also worst-case. For example, every worst-case 
heap on 12 or less nodes is hereditary worst-case. 


For any hereditary worst-case heap of size A, its creative sequence yields 
a substrategy sub that is optimal for any i with 1 < i < N (not just for 
N — 1). The loss of credit function Asub(^) for its moves i coincides with 
the delayed loss function X*{i) dehned by (129) page 57 for all 1 < i < A, 
and, therefore, is minimal for every move. Thus the creative sequence of any 
hereditary worst-case heap yields a greedy substrategy. 
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RemoveAllRec: Remove of 22 took 8 comps. 

RemoveAllRec: 22-eT.ement heap deconstructed in 111 comps. 

RemoveAllRec: Theoretic worst-case number of comps = 111 
»> Hereditary worst-case 22-element heap level - by-1 e vel: 

Level 0: 22 

Level 1: 

Level 2: 1 

Level 3: i8''''''''''''19 

Level 4: 16''''15 17^''^6 4 

Creative sequence: <1, 1, 1, 1, 2, 1, 1, 1, 4, 1, 1, 1, 2, 1, 4, 6, 4, 6, 8, 9, 4> 

»> Number of comparisons while sorting = 148 
»> Theoretic worst-case number of comps = 148 
BUILD SUCCESSFUL (total time: 0 seconds) 


'14 


13 ^ 


/v/vyk/v/vyvys i 


Figure B.40: A hereditary worst-case heap of 22 nodes produced by creative sequence 
(1,1,1,1, 2,1,1,1,4,1,1,1,2,1,4, 6,4,6,8,9,4) whose 12th move looses 1 credit to the up¬ 
per bound [lgl2j -H [IgllJ =6 and all other moves are lossless. 


For example, one can take any 12-element worst-case heap (one created by 
win(12) will do) and apply to it any greedy strategy of pnll-downs. With rel¬ 
atively straightforward experimentations, one can find this way a 20-element 
worst-case heap [20,19,15,18,10,13,14,16,17,4, 5, 7,12,2, 3, 9,11, 8, 6,1] cre¬ 
ated with a greedy substrategy (1,1,1,1, 2,1,1,1,4,1,1,1, 2,1,1, 6,1, 6,1) 
whose 12th move looses 1 credit to the upper bound [Ig 12j -|- [Ig 11J = 
6 and all other moves are lossless. 


Unfortunately, one can only go so far playing greedy as the largest hered¬ 
itary worst-case heap has only 22 nodes. For instance, a hereditary worst- 
case heap of 22 nodes is visualized on Figure [B.40[ Given the function credit 


cr(i, k) defined by (32) page 32 and the discussion of cases of maximal credit 
on the following pages, one can write a simple Java program (as I did) that 
generates all greedy substrategies and, by the Mapping Theorem |6 .1 1 page [37l 

finds all 1017 hereditary worst-case heaps by means of pre-order traversal of 

FoTl 


a tree of their creative sequences An excerpt of an output of such a pro¬ 


gram is shown on Figure B.41 The complete output has been posted at: 


\protect\vrule widthOpt\protect\href{http://esc.esudh.edu/suchenek/Papers/Hereditary_wors 


lOTpor instance, the above mentioned 20-node hereditary worst-case heap appears as 
698 on the list generated by my Java program. 
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I can't find any herGditary worst-case heap of size 23 
Stats: the total number of hereditary worst-case heaps is 1017 
1 hereditary worst-case heap of size 1 

1 hereditary worst-case heap of size 2 

2 hereditary worst-case heaps of size 3 

3 hereditary worst-case heaps of size 4 
6 hereditary worst-case heaps of size 5 
6 hereditary worst-case heaps of size 6 

4 hereditary worst-case heaps of size 7 

18 hereditary worst-case heaps of size 8 

36 hereditary worst-case heaps of size 9 

48 hereditary worst-case heaps of size 10 

32 hereditary worst-case heaps of size 11 

12 hereditary worst-case heaps of size 12 

12 hereditary worst-case heaps of size 13 

12 hereditary worst-case heaps of size 14 

8 hereditary worst-case heaps of size 15 
56 hereditary worst-case heaps of size 16 
112 hereditary worst-case heaps of size 17 

144 hereditary worst-case heaps of size 18 

96 hereditary worst-case heaps of size 19 
120 hereditary worst-case heaps of size 20 

192 hereditary worst-case heaps of size 21 

96 hereditary worst-case heaps of size 22 
0 hereditary worst-case heaps of size 23 
Control sum = 1017 

BUILD SUCCESSFUL (total time: 3 seconds) 


Figure B.41: A statistics of all hereditary worst-case heaps produced by a Java program. 
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